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Definition, Scope and Goals of Econometrics 


\1. Definition and Scope of Econometrics 

Literally speaking, the word ‘econometrcs® means “measurement in economic 
Econometrics may be considered as the integration of economics, mathematics and 
Satistis forthe purpose of providing sumerial values fr the parameters of economic 
relationships and verifving economic theories. Iisa special type of economic analysis 
in which the general economic theory formulated in mathematical terms is combined 
‘with empirical measurement of economic phenomena. We start from general economic 
theory, that i, from the relationships of economic variables as suggested by econornic 
‘theory and express them in mathematical terms. This is called building of an economic 
model. Next we use statistical methods in order to obtain numerical estimates of the 
coefficients of the economic relationships. These statistical methods are called 
econometric methods. 

‘Although measurement is an important part of econometrics, the scope of 
‘ssonometrics is much broader, as can be seen from the following quotations 

“Econometrics, the result ofa certain outlook on the role of economics, consists 
‘of the application of mathematical statistic to economic data to lend empirical support 
to the models constructed by mathematical economics and to obtain numerical results! 

“Econometrics may be defined as the quantitative analysis of actual economic 
phenomena based on the concurrent development of theory and observation, related 
by appropriate methods of inference.”? 
| Beonometrics may be defined as the socal science im which the tols of econo 

‘mathematics and statistical inference are applied to the analysis of econo 


“Beonomeris is concerned with the empirical determination of econornic laws." 
“The art of the econometrician consists in finding the set of assumptions that ate 
both sufficiently specific and sufficiently realistic to allow him to take the best possible 
‘of the data available to him’ 
“‘Bconometrcians .. are of positive help in trying to dispel the poor public image 
(quantitative o otherwise) asx subject in which empty boxes are opetied 


‘of Chicago Press, Chicago, 1968, p_ 74. 
ee Serres N.S, “tees fe Enlmive 
2" Beonometrica, vol. 22, vo. 2, April 1954, pp. 141-186, 
‘Feonomeiric Theors, John Wiley & Sons, New York, 1964, p. 
af Feo Joba, Wiley & Sons. New Yok, 1971, | 
‘of Econometrics, Rand Mc Nally, Chicago, 1966, P. S14 
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‘by assuming the eterce uf ca-opener 0 Pees cles which ay fen ecoromits 
el terre Tap" 

"The tho of ecosumeb teers ams, etl ala cajun af econo 
‘neo ae acta scaurerentn eg the ery and evga tall fren 
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me (Ft dei we Scion 22.1) 


between Econometrics and Economic Theory 
kes semen or hypotheses that are ansly qualitative in 


Pe rE ee CTT nN 


DEFINITION, SCOPE AND GOALS OF BOONOMETRICS 3 


The proposition suggested by economic theory \4 to be tested now, applying 
econometric meth we fil tha the theory He consistent with the empirical revulla 
‘we accept the theory but If we find that wo consistent with the empirical revue, 
thes we have either to reject the theury orto modify he theory. If we like 10 modify 
‘the theory then we should not reject the theary, rather we should renporate seme 
‘other variabley and parameters $0 make the theury more meaningful (4nd clove to 
reality) 

or example, he smple consuytion incune relation, C= a + HY ~ a canbe modified 
1 the form 

Cora + bY > cP + dW + u where two new varibles P (price level} and M (eat) 
Ive been taken into account inthe functional reltion. The signe of the parameters 
(a, be. > 0) and the comesponding response coetTicients can sl be tested 
epic 

11.2.1, Difference between Economic Model and Econometric Model 

‘A model js simplified representation of a real world process. In practice, im any 
‘economic model (say consumption function et demand function), we can ice all the 
relevant variables that we tiak are relevant for our purpose and dump the rest ofthe 
‘variables in basket called "dsturbance” This brings us tothe distinction between 
‘an scanomic model and un econometric model 

‘An s20nomic model is x set of assumptions that approximately describe the 
behaviour of an economy. Ar econometric model, on the other hand, consists of the 


(A vet of behavioural equations derived from the economic model 
i) A statement of whether there are errs of observation in the observed 


‘variables 
Ail) A speciation ofthe probability dvcibetion of he “sturbances "(and erory 
__ of measuremen), 


example, we may consider a simple deniand mode! of evonomics, Then 
‘mode! will usally consist of: 

The behavioural euaton =~ a+ ip > where g quantity demanded, » = 

‘and are two parameters end v ~ random disturbance term. 

‘A specification of probability distribution of u, where valuos of u are 
ad normally distibted with mean F (u)=0 and variance (x) 0}, With 


‘economies employs mathematical symbolism. Bosh express the Vanious 
Tina exact form. Nevtaer economic theory sor mathematical 

for random elemens which might affect the relationship and make i 
‘they do not provide mumercal values forthe cocciets of 

in economic theory oF i mushemstical cconomics are of 
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Although ecanometncs presopposs the expression of economic relationships 
‘mathematical frm, ke mathematical economics it does not assume thst eeot0mic 
‘elationshipe are exact. On the contrary. conometrics assumes that relationships ar 
‘ot exact. Econometric methods are designed 0 take ino account random disturbance 
Weick create deviations fom the exact Behavioural pllems suggested by econom 
theory and ntheraical economics Furbermere, econometric methods provide 
‘mumerical values of the coefficients of economic phenomena. Thus by combining 
‘mathematical formolations of theory with empirical dats, comometrics enables us 
se fm the abact theoretical scheme 19 numerical reslts in EOREFte ses 


14, Econometrics and Statistics 

-Econometics difers both from muthemtical statistics end econnanio statistics An 
conomic statitician gathers empirical data, records them, tabulaes them oF chars 
‘hem, and then alerts to descbe the putiem in their development over tine ans 
_pethaps detect some relationship between various cconomic magnitudes, This 


esonomie lif, These adopted statistical methods are called cermometic metho, 1p 
Particular, economitric methods ure so adjusted that they become appropriate for the 
Ieisuremient Of économie relationships which are stochastic, that iy they clude 
‘random elements. The adjustment consists primarily in specifying the stochuntc 
Ganda) slenees a re npposed operat he ral ord weet The 

ofthe observed dit, so hatte later can be interpreted As (randon) 
‘sample 10 which the methods of satitics can be applied 


15, Goals of Econometrics 


aking : The namericaleximacs of the coefficients of the econo) 
help the plieysmaker io define the appuprinte policies For example, nc 


: f mumercal estimates of price elasticities of 
Pibde weno ow tow the eration as + policy wil be 


SY 
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Successful econometric applications should really snchude some combination of all 
‘those aims, 


1.6. Division of Econometrics 
Bconometrics may be divided into two branches —theotetical econometrics and 
applied econometrics 

Theoretical econometrics includes the development of appropriate mets forthe 
‘measurement of economic relationships. Peonometric techniques are basicaly statistical 
techniques which have been adopted to the particular characteristics of economic 
relationships. 

‘Econometric methods may he classified into two grovps . (i) Single-equation 
techniques, which are methods that are applied to one relationship at time, and (i) 
Simultaneous equation tecniques, which are methods applied to all the rlatonsbips 
‘of a model simultaneously. 

‘Applied econometrics includes the applications of evoncmeteic methods to specific 
‘branches of economic theory. It examines the problems encountered and the findings 
‘of applied reasearch in the fields of demand, supply, production, investment, 
consumption, and other sectors of economic theory. Applied econometrics involves the 
application of the tools of theoretical econometrics for the analysis of economic 
‘phenomens and forecasting economic bebaviou 
1. Methodology/Stages of Econometric Research 

Appi cman ac sconce vit te meanest ofthe prates 
Aeon eine und wits he petcn of he vals of economic varubes 

‘The relationships of economic theory which can be measured with one or another 
‘econometric techniques are causal, thai hey ace relatiaships i which some val 

ostulated a8 causes ofthe variation of other variables. In this sense definitional 
donot require any measuecment. For example, the equation P= C¥ / is the 
expression ofthe definition of national income (i closed econam, ith 
‘ctvity) of economic theory. It does not explain the determination of 


‘considera production function ofthe following type Y=/(K, 1) where 
are the two factors of production. 
= Capital, L = Labour and ¥ is the level of output. This function can also be 
the Cobb-Douglas form ic, Y= K* 1? or, log Y= a log K ~ fh log L. This 
form of lg linear function. Here some theoretical restrictions must 
<a B< 1, 
1 if there are increasing returns to scale 
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Aha variabes ofthe fcr 
‘among the explanatory variables 
= BY + ByW > 4 where 
‘ariable) and explanatory varialen a? 
evel Naw we have o find ut wheter tere exe 
variables (i.e. wrelatin between Yand 0" Y wn 
inert) 
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the wandard errors of the parameter estimates. They help us to establish whether the 

‘extimater have the desirable properties af nbvanednens, comsstency 6 
‘Stage D : Evaluation of the forecasting power of the estimated model 
‘The objective of any evomnometn rrsear it obtain god numerical estimates of 

‘the coetMcients of economic felthonshipn ant ruse them forthe pretiction of the 

‘yale of economic variables Estimates ofthe parameters are useful caus these help 

the policy makers in adopting the pices. A model fier the estimation ofthe parameters 

‘an be wred in forecasting the values of the economic variables. So the ecometticiany 

‘wuet Yet the forecasting power of the model 

18, Desirable Properties of an Econometric Model 
‘The ‘goodness’ of an econometric made! is judged customarily secording to the 

{ollowing desirable properties 

(Theoretical plausbitty : The model should be compatible with the pontlates 
cf economic theory. must describe adequately the ecinemic phenomens ts 
which i relates, 

{Explanatory ability : The mode! should be able o explain the observations uf 
the actual world. It must be consistent with the observed behaviour of the 
‘economic variables whose relationaip it determines. 

il) Accuracy of the estimates of the parameters The exter of the coetfeten™ 
should be accurate in the sense that they sbould approximate a» bev! o 
pomsible the trve parameters of the strctural mode! 

Gh) Forecasting ability : The model should produce satistactory prediction» of 

future values of the dependent variables 

(0) Simplicity : The model should represent the economic relationships with 
‘aximar simplicity. 
and Sources of Data for Economic Analysis 
suceess of any econometric analysis depends on the availability of the 


time series is a set of observations on the values that a variable takes af different 
“Such data may be collected at regular time intervals, such as daly (eg. stock 
‘weather reports), weelly (eg money supply fies), monihly (ew. 
‘rate, Consumer Price Index (CPI), quarterly (¢.g.. GDP), amually (ee 
‘budger), quinguennially, that is, every S years (ep. the census of 

‘0 deconnially, that is every 10 yeas (e.g. the census of population). 


ata are data on one or more variables collected atthe same point 

ts the census of population conducted by the Government of India every 

the Survey of household consuuner expenditure in Inia conducted by National 

‘Organization (NSSO), the opinion polls by the Times of India, NDTV, 

and many other organizations. An individual researcher or a group may so 
data directly from the field of enquiry 
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Conventionally the leer ¥ denotes the dependent variable und 18 (Xi X ., 
denote the explanatoryiindependent variables, being the Ki explntory vara 
The subscript or denote h rte 1th observation ovale. Xj (OFX) il dens 
| the wh (ore) observation on vale Xp Her (or 7) il dene te tt 
of observations or values in the population and n (orf) will denote the ttl un, 
| of sbeerations ina sample. Norell the sobcrp wil be used fr erstsetion ds, 
(ie, data collected at one pot of ime) andthe aubsript wil e used for time ser 
dia ie, data collec om diferent periods of tine). For instance, consider, 
Keynesian consumption function ofthe form C ~ a ~ BY where C = consumption 
expendtae, ¥ ~ income and a und b are two constants, @ = autonomous part» 
Consumption expenditure, b = marginal propensity 10 consume. According {0 th 
‘existing theory a > 0, 0 < b < I. If we like to test this relation with the help of timc 
series data, then we wil write the regression equation in the form C, = a + bY, + u 
(where = 1,2, (89) where ws the random distrbance term. On the oer ban 
‘Wwe ean write the regression equation inthe form C,= a ~ BY, + tp f= 1, 2, sm (sty), 
hen we verify the relation with the elp of crot-scton Ut 
Pooled Data 
Pooled or combined data are elements of both time series and cross-section data 
‘Gcnerally speaking. poled data is combination of data (1.2 sles, adverinemen 
‘eurnings ete) of say 20 firs over a given period of time say, a year or two. These 
| ‘combined data of 20 firms in 2 years making 40 observations make ‘ita pooled data, that 
‘5, pooling 20 fimss’ dats in 2 years together. So, itis « combination of cross section 
‘data und tiene series data. 
Pane ang or Wiopane! Dat 
Mbt So ceil ‘of pooled data in cross-sectional unit (say 
1 tal ec) ote ner ng Ae THe wi oy 
For example, 


‘financial ‘Periodically (10 years interval), data can wide 
‘sci inoraton onthe changes that may have accared fn then nee 


data in 9 short period of time, 

pot atte Resi difference berwsen poled data and pane! daa. I should be 

fercl ir orlenratrrgere dats are data with relatively few cross. 
enim ariables arc held in cross-section specific 


Oteanization), an intenational Ce. : 
Lay ne arta ya 
‘data for one purpose or santas Na aL THe exist ml of ugeacies colleen, 


Aitemet bas revolutionized dats 


EE 
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nering. Most ofthe data can be downloaded from different websites ether free of 
owt or with minimum cost. 
‘The Accuracy of Data F 
Although plenty of data are wviable for economic research he quality of datnin | 
‘often not that good. } 
"There are several reasons for that 


(@ Most of the social science data are non experimental in nature Therefore there 
{is the possibility of observational errors 
@ 


Even in experimentally collected data, errors of measurement arise from 

approximations and rounding offs 

(iil) In questionnaire type of surveys, the problem of non-response may lead \o 
bias in results. 

v) ‘The sampling methods used in obtaining data may vary so widely that i is 
‘often difficult to compare the results obtained from the various samples 

(¥) Economic data are generally available at a highly aggregate level, Such highly 
aggregated data may not be helpful for individualistic study, 

Because of all of these and many other problems, the researchers should always 

keep in mind that the results of research are only as good as the quality of the data. 

‘Therefore, if in given situations researchers find that the results of the research are 

‘unsatisfuctory” the cause may not be that they used the wrong model but due to the 

poor quality of data. 


110, A Note on the Measurement Scales of Variables 


‘The varinbles that we generally use, can be measured in four types of scales : ratio 
Interval scale, ordinal scale and nominal scale. We can briefly describe them 


to Seale : For u variable X, taking two valucs say X, and X;, the ratio X\/X and 
(% ~ X}) are meaningful quantities. Also, there is a natural ordering 
ding or descending) of the values along the scale (say X, 2 X, or X3 < Xj). Most 
‘Variables belong to this category. Personal income, measured in rupees is a 
ble, someone earning % 50,000 is making twice as much as another person 
© 25000. 
‘scale : The interval scale satisfies the last two propertics (stated in ratio 
‘not the first 
‘example, the distance between two time periods, say (2018-2001) is meaningful 


y of the ratio scale (ie., natural ordering). Examples are grading systems (4, B, 
‘or income class (upper, middle, lower). For these variables the ordering 
the distances between the categores cannot be quantified, 


Scale : Variables in this category have more of the features of the ratio scale 


: Variables suchas gender (nal, female) and marital sats (mare nme 


» 
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The Simple Linear Regression Model 
————S 


2), Introduction 


“Most of economics is concerned with relations among variables. These relations 
‘when phrased in mathematical terms can predict the effect of one variable on another 
For example, assuming that income, prices of other commodities end all other 
determinants of demand are constants, we can express the quantity demanded (g) of 
any commodity as function of the price (p) of that commodity only, This may be put 
in the form q =f (p). Similarly we are familiar with other functions (with different 
assumptions) such ws consumption function : C= /(1), supply function : $= f(p), cost 
function C= (9), production function : Q= f(x.) where x, and x, are amounts of 
different inputs, etc. 
"These fimctional relationships define the dependence of the dependent variable 
the independent variable () in the specifi form. The factional elation may be 
logarithmic, exponential or hyperboli 
between two variables X and Y expressed as Y =f (X) is ssid to be 
tc or non-stochastic (non-random) if for each value of the independent 
there is one and only one corresponding value of the dependent variable 
he other hand, u relation between X and ¥ is said to be stochastic if for a 
‘of X there is a whole probability distribution of values of ¥. In such 
piven value of, the dependent variable ¥ assumes some specific value 
‘some probability. 
‘linear demand function (in deterministic form) can be written ws : q 
fip (a> 0, <0) and in particular g = 100 - Sp. When p = 10, q = 50, 


‘behaviour of the above relationship breaks down when the ceteris 
Temaining the same) condition is relaxed. 


tion sbout the whole. This sggregste or totality of statistical 
‘character of all the members covered by an investigation is 
‘When the population size is very large, it may not be 


W 
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as fect» sal pr 
pouible w uke a compete enumeration ofthe pupuation THN SE i, 
St the population called sample and examining to make fee 


‘ature ofthe whole population. The basic oct 
onl pat of tween {0 OF Ter 


‘model We assum 
rade We ass 
arate analy 


‘We know that regression analysis is ASI) Sone t endent variable (Y) 0 
nto purenter, ay roc vale of 
ie Bike yetical community and 


he fact we consider a total population of 60 families in & * ture (¥), both im rupecs 
{hse mentlyfacome (1) and mow consamon Pen is 


eee a eon en te bowing le ale 21) 


amily i he varios oe 
ae anre i stolatauarbuion of monty ome 0 €) and monthly consumption 
expenditure (¥ in €) of 60 families in = community 


15000 
15200 
17500 
7800 
18000 
18500 
19100 


D600 121100 


7709 1900 10100 11300 12500 13700 1490016100 17300 


fxed values of X and the corresponding Y valves against each ot 
a aren or From Table 2.1 we see that 

variation in monthly consumption expenditure in cach income 
‘group but the general picture is that despite the variability of monthly consumption 
‘expenditure within each income bracket, on an average, monthly consunpl 
‘expenditure increases us income increases. To understand it clearly we have given th 
‘mean, or average montly consumption expenditure corresponding to each of the 


Jevels of income. Thus, coresponding to the monthly income level of E800), the men, 
"76500 and s0 on. In total we have 10 mean values for |’ 


of Fs pep cond 
Paty vonage called expected values »s 
they Taal vel iene st GGchtoneg) vais a 


; 
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13 
Symbolically, we denote them as (Y2¥) which simply means the expected value of 
Y given, the valie of X. It should be noted tht these expected values ay called 
conditional expected values. In order 1 calculate conditional expected values of ¥ we 
have to constrict conditional probability distribution of ¥, PUY/X), shown in Table 22 


‘Table 2.2: Conditional Probabilities Y/N) for the data of Table 2.1. 


000 10000 12000 14000 em) 18000 20000 22000 24000 2ec00 


SS ae a er ee 
SS il a ee a ee 
thal 3 5 toe aay pe oe a 
“te sicid ER Rie fea ale Sa 
Pett Pk a 
O08 Oe a i a a a 
ee) a. BS Ga ae 
Oe beet i i a 
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SRO Ae Shi egtsie ot! SF 
Sy leh wel) od t 
6 Tete ot. ot 

1 1 


of Y| 6500700 _ 80 10100 11300 12500 13700 14900 600 17300 


‘consumption expendinures forall the 6 families in the population and divide 
by 0, we get the value £12120 (727200160) which isthe unconditional 
or expected value of ¥, £(2), 


the expected monthly consumption expenditure of «family would be 12120 

mean). But if we like to know the expected value of monthly 

‘expenditure ofa family whose monthly income is say €12000, then we get 
(the conditional mean). 

y, if we join these conditional mean values, we obtain the population 

(PR) or population regression curve or simply itis the regression of 


Tepression curve is simply the locus ofthe conditional means of the 
orth fixed values of the explanatory variable(s). More specifically, 


» 
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eorrosponding (0 
is the curve connecting the means of the wub-popolations of ested he 


‘values of the regresnor JC This (shown 4 Figure 2 


AVX) 


a. 24. Population rpemion ie (ata of Table 2.1) 

‘This figure shows that for each X (Le, iaceme level) there ix a population of ¥ 
vals (menthly consumption expendi) that are spread around the (eoitions) 
mean of those Y values, For simplicity we have assumed that these Y values arc 
distributed symmetrically around heir respective (conditional) mean values and tho 
‘regression line (or curve) passes through these (conditional) mean values. 

(2.1.2. Population Regression Function (PRF) 

‘From the ubove explanation and Figure 2.1 itis clear that each conditional mean 
LY2%) is a function of X, whete X, isa given value of X. Symbilicaly, B(YIN,) =/(X,) 
‘where f(t) denotes some Function ofthe explanatory variable X. 1 is linear function 
‘in X), This function is also known as the conditional expectation function (CEF) or 
Population regression function (PRF) or population regression (PR). It states mercls 
that the expected value of the distribution of ¥ given X; is functionally related 1 .\, 
In simple terms, it tells how the mean or average response of Y varies with X, However 
‘the functional form PRP is an empirical question. For example, to verify th: 
‘consumption income ‘we generally assume a linear relation. We may assuine tha: 

eB function of X, say, ofthe type 

@ and fi are unknown but fixed parameters known ay th: 
‘and B are also known as intercept and slope coeificicnt. 
ion analysis our interest ix in estimating the PRE and the 
id 00 the basis of observations on Y and xX. 
stated in Table 2.1 we see that, given the income level «+ 
amen the averay 

-aroand its conditional expectation. Therefore, 
ia eee the (fan individual ¥; around its expected valve as follows 

3B (WAR) 0 Y= BUR) +, ¥,~ a= BY, + 
is an unobservable random variable taking positive or negativ: 

values. as the Stochastic disturbance term or Stochastic ‘error 
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2.1.3. The Sample Regression Function (SRF) 

In discussing population regrewsion we have deliberately voided sampling 
‘considerations (Data of Table 2.1 represent population. not sample). In mort of the 
‘anor we entimate the PR on the basis of the sample information taken from the 
‘popolation. Por example, we avsume thatthe population of Table 21 i not Knwrwn to 
‘uc and the only information we have isa randomly selected sample of ¥ valves forthe 
fixed 4°4 a shown in Table 2.3, Unlike Table 2.1, we have one T corresponding 1 the 
siven 4s 5 each (given 2X) im Table 23 is chosen randomly for similar Y's 
corresponding 10 the same X, trom the population of Table 2 | 

‘Table 2.3 Tobie 24 
‘A random sample trom the ‘A random sample from the 
population of Table 2.3 population of Table 2.4 
x 


ed 


700 
60 
9000 
9500 

11000 

11500 

12000 

14000 

15800 

13000, 


_ Now from the sample of Table 2.3 we can predictor forecast the average monthly 
tion expenditure ¥ in the population as a whole corresponding to the chosen 


‘Population regression funetion we know thatthe fmction is of the 
4 = i, the sample counterpart of this equation may be writen as 
LX, where 


Of ELYN)), d= estimator of a and j= estimator off, 


» 
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11 should be noted that an estate, also known as w (sample) nari 
4 rule of formula or method that felix how No estimate the population paraincte 
the information prided by the sample at hand. A particular numerical yatuc 
boy the estimators known as an estimate. It should be noted that an esti 


random but an estimate 1s no random. 


Monitty coneurmtion expendi yn 7 
SERGTTEEEEUEE 


a eee 
‘Men ncnein® 
e-22. Ragemioa Hoes bse co to diet sumpls 

Lik the population reprenion fencon (Y; = ELIA) = uj = 0 = BX, + 1) ve vs 
expres the sample regression equation Whi ew nice tas fotos 

Todi BX, +i where 4, denotes the (sample) residual term. Conceptually i, 
‘analogous to wand can be regarded as an estimate of u, It is. introdced in the 51 
for the sanve reasoas as u, was introduced in the PRF. 

Now our primary objective in regression analysis isto estimate the PH. 
(9X, = uo the buss ofthe SRF, ¥, = =fiX, +. It should be noted that the « 
‘of the PRF based on the SRF is at best an approximate one (at sampling fluctua!» 

‘We have to develop procedures that el us how ta constrict the Sf 10 


"a fihflly as possible, 
Po mated Linear Regression Model 
suggested by economic theory are usally spccifisd 9s cv: 
oa eee, 
placed on the need for testing theories. This implies 4 belie in 
existece Senha Ta ‘of econometrics tries to ts! 1h 
in terms of stochastic variables. The simplest form of (1 


‘theoretical propositions 

‘relation between two variables X and Y is called a simple linear regression 21 
‘is given by ¥,= + BY, + u, for F= 1,2, ., mwhore Y= dependent vant 
explanatory variable (indcpendeot variable), u ™ stochastic disturbance term. 
‘are two regression parameters whose Values are to be determined on the bas! 


a 


‘ 
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given date on X'and ¥, The subseript "refers tothe / th observation, m = vample size 
‘0c number of data points 

‘The stochastic nature ofthe regression model implies that fr every value of there 
‘xn whole probability distribution of valves of ¥. In other words, the valve of ¥ can 
‘ever be predicted exactly. This uncertainty concerning the value of ¥ ares becauye 
ofthe presence of the stochastic term °u’ which imparts randomness to Y 

2.24. Role of Random disturbance term in Econometric Made! 

We may ask why should we add an error trm or random disturbance term un an 
econometri model ? 

‘The disturbance term w, is a surrogate for all those variables that are cmited from 
the model but that collectively afect Y. We may give the following reasons forthe 
inerton of the disturbance term in an econometric model 

(i) Omission of variables from the function : Suppose, inthe moslel Yu + As 
1% the variable ¥ denotes the consumption expenditure and X denotes disposable 
Income, But in reality isnot the only variable influencing Y. The family sz, tates 
of the family, spending habits and soon affect the vanable Y. The error ‘xi 8 catch 
al forthe effects of all these variables, some of which may not even be quantifiable, 
and some of which may not even be ientifinble. Therefore uy may be used as 
substtte for all the excluded or omitted variables from the mode 

(Gi) Unpredictabe element of rondomness in human responses : For instance. Y 
"= consumption expenditure ofa household and X ~ disposable income ofthe household, 
there isan unpredictable element of randomness in each household’ consumption. The 
hhnsehold docs not behave lke # machine. In one month the people inthe housebold 
te 00 # spending spree. In other month they ar tightfited. 

i) Imperfect speciation ofthe mathematical form ofthe model : We may have 
a etiam baton Xt Y or we may have left out 

‘model some equations. 
tis because the economic phenomena are much more complex than a single 
muy reveal. Fr example, price determines and is determined by the quanity 
oe quantity demanded) inthe mackst. Under such crcurstances if we atlerpt 
twatudy the phenimena wih a single equation mod, we are hound to commit an ero. 
the diubance term represents soch an err vihich may be due to imperfect 
‘of the term ofthe mod, that i, of the number of equations 
Gv) Core varlobls versus peripheral variables: In consumption income relation for 
‘we may observe that besides income 2, the mumber of chikren per family 
sex Xy, religion X,, education X, and geographical region ste. also can affect, 
expenditure. But its quite possible hat the joint influence ofall or some 
variables may beso smal that asa practical maticr it doesnot 
into the model explicily. However, their combined effect can he treuted as 
variable u, 


‘as possible. If we can explain the behaviour of ¥ “substantially” 
‘explanatory variables snd if our theory is not strong enough to suggest what 
might be incladed, why introduce more variables ? In such cases u 
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Teprescot ll cer varablen OF coure, we show motel relevant and 


in measurement (Errors in variables) : Another jurtificniior 

‘te iseton of « drutance termi Ua represen the reesurement erors 

‘rewarding or processing of the data on X and Y Thus the disturbance term reflects 
“errors in the observations, 

“For al these reasons, the stochastic diturbances , assume an extremely «1s 
‘role in regromion analysis. 

23. Classical Linear Regression Model and its Assumptions 

_ Let ws consider an observed relation between two variables X and ¥ which ¥» (60 

by t= a+ By 12,0 m where ¥ ty the dependent variable, 
the disturbance term, the subscript “i denotes the Wien» 
ohh - eee values are 10 be estinated 


follows normal distribution 
=: This menos that the probability 


shown a follows : Since Y, 
+ BELA) + Blu) c+ JX, as Lu) 


ys a ty ae identically 


(0) 


{THE SIMPLE LINKAR REGRESSION MODEL, 
‘Asnumetion 41 The different eter Yerms we independently distributed ie, Fy) 


[Naw Civ (ip 4) = 0 for Oe) 
send Cov (up) 8 For mf where b= 1, 2. m 


5: The independent variable X \s non-stochastic oF non-random 
als ieavenn 


9 


eet fr te ep 
to the equation Y= a. {X; * mp ¥, bs linear funetion of w, Since u 
Cine sain Fa Bw Mea Seon vo nereaty 
i) N= a+ BX +m 
BY) = Blas (Ky +m) {Flay =a, Eu) = 05 


eeeres 
“This means tha the mean of ¥, is a AX, 


Gi) Yur) = ED, FP = £1 BODP = Bta+ Bx, ou Cee BX, HP 
, = E(u? =03[+ Bu, = 03 | 


‘we say that variance of Y, is 2. 


= N(a+X),03) when w,~ N(,o3)- This sisted in Fig 23. 
Tine. This regression line is unknown 


4! imply 


counterpars 
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Since y= 0 + BY; ¢ 4 foe (= 1, 2... 9 the regresion equation. 

‘The sample counterpart of, he etimated ero (mbich la called he rsh) 
defined ae 8 = 1, -8.-A,- 

The two equations to determine and fare obtained by replacing population 
assumptions by thet sample counterparts 


Population Assumpilon Sample Counterport 


a 


BU) 


ev axn=0 
‘Thus we get the two equations 
Sano. Bor -fN)=0 
and Sia o0 08 Sx.0, 
Si fi 
‘hse ios can te wien a 
Bide a 
Lana Da? 
= 


and). The etinated elation becomes f= +t and e~ fhe residual erm 
“which shows the difference between the observed and estimated value 
“The method of least squares consists in finding out those values of and fj for 


‘vec GW enone Tis woes tat we have to miniaine Soot = Zr, -H? 


ae 
e722 -6-it)-0 =) 


ALAC -6-f=0 
ct 
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| Asin frm equation (3) We 2 


bE 


||} we Shea iS AL oe Feaelit ae F-AP 
| “ 


papa regression equation J, 
‘OLS method we have estimated the linest a 
Er oS meen epi of LRM. THe SST! " 


equation becomes, adit; wine & and fobs 1S enim of 2 


Te wo put 5 =) - and yn XT then we have 
Son 

bE wd = Pit 
xa 


‘oF Yon 4: In this case the regression 


me Ugreion equation (of Yon X) where isthe 
‘Sometimes we have to consider 
reeression, 


TS 
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‘he einai nation Beem fy 04° f° and anf whch oe he 
esa term, showing the dflerence between the shure ind stint ato Tre 
method of last squares consists in ding owt tone vues of i andj for which 
Se sini 1 mes hat we re nie 

be ‘th he eno a The 
ecemary comtons of minimisation rye 


2 Reena. =) 


2 geet —@ 


equations (1) and (2) we pet two wormal cquatins 


FR 


a8 Foo 


the equator: 


a 
a se a 
joi tele fee “ce 
0 Sane sent reer lb 
<7) | ket 
og tar ar Mea 4 
eg ee er 
(Sige ae ato 
eS ee ee 
ora ene 
o » 2 4 on + 016 
Total uo, % ty, te? wy? 
oom 4% Hy SE AR 


Fegresion equation of Y on X (direct repression) is given by, 
on 7364075." 


0.690 = 96-1376 
reverse eyression eqeation (X on 1) is given by f= 4° (ry 


the estat rerio coefficient of oa Y 
= 

11600 ~ 0.5175 = 0.52 and rey * JOSIE ~ 0.7193 ~ 0.72 
and Units of Measurement 


‘amalysis the units in which the regressand or the dependent 
Fegressr(s) ace measured make difference in the regression results. 


| re 
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‘sitet, applying OLS mete to equation (4) on 
ae Tek (10) 
+e An ster 
vw Ge ot Tr) 
2 
weine S oy 


e-Be _uo 


‘Thus we se that from mode (1) and jj are the OLS estimators of «aod (and 


(2) f* und (\* are the OLS estimators of * anu. From the above remus 
i is ensy 0 establish relationship between vo sete of arate, 

‘Slnce 17 = Wi (OF 9} ay) XY = WX, Ce x) m5); uf = my: Fr wih 
and Rea, we can catty verity that 


a (Be con lS) 
ama (16) 
af =wjed C7 
Me Gym? Ver @) ay 


‘we can derive the results based on another scale of measurement once 
facies are kown. From the results given in (15) wo (20) we can also derive 


by the fctor W; the slope as well as the intercept coefficients and their 
ferors ae all mukiplied by the same W; factor. Finally, ifthe Y sale 


~ 
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remains unchanged (i, = 1 bat the X acale is changed by he faa abe str 


weft nd ur un rte maid (7) It te iter 


en) 


unt) (aoa) = 08 
‘ayation of GDS on GDF (both GDS and GIP 


ags\-036G0F, 022) 
177210078) (002) 7? ~ RRL 
eth he expt amis standard noi 100 mes the corte, 
fon (21) [we shalt noe that Y= 100 polng fromm ere toa 
Ferre ~ 100 ahs, at esl ein a wel sts tana cy 
jn woconance wit the theory: 
we manure GDS in rypocs crore and GP in rupees lakh, the estimate 


its value 


cal is changed. 
GDP in eupees crore, the estimated regress 


wich havea pero interes" 
ong ran cons 


consumption expendi. 


‘THE SIMPLE LINEAR REGRESSION MODEL. a 
‘In this event we should estimate the function Y= a+ AUX = u, imposing the 


Lan, 
reatrction a= 0, The formula for the estimation of then becomes ji = — hich 
rae 
a 
‘involves the actual values of the variables, and not their deviations, as in the case of 
‘unrestricted vale of a. 
Proof: We want to fit the line Y,~ a ~ AX, + ap subject to the restriction w= 0. 
‘Tocestimate [the problem is put in a form of restricted minimization problem and then 


‘muliplir. Now we have to minimize “L’ with respect to, fi and 
‘of minimization require + 
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(©) Estimate the elasticity of demand atthe price 4 
(@) Forecast the level of demand if price rises to 5, Comment on your forecast 
‘Solution : (a) Let ¥, = a + BX, for i= 1, 2, 6, be the linear demand function. Hy 

the OLS method we can yet the estimators of a and f. Here Y ~ demand, X= pice, 

% flare two paramaters. Theoretically we may assume a >, fs <0. ‘By OLS method 
mu 

De MH eK, yan -F nd ao 7 fz, FeLDX In 


F-Dyim 


euatons fr the parameters (a, 
Yer memtty mie Tey AAR ay, 
(00%) 


estimated demand function is 7 =a-jix or £=¥= 107 - 19x. 
with the theory where we assume a> 0 aod (<0. Thi 
exists an inverse relation between price and demand (ic, the 


and G = -[W = 63 ~3.25 « 9 = 63 ~ 29.25 33,75, 
“Thus the estimated supply function 
Here we see that 


Poashx on Fe 3375 -42sy 


= 3295 > 9 and §=3.25 > 0. This menns tht there is 8 dret 
(postive) relation between supply and pice. The intercept of the supply function ix 
theory 


Dustve here. Hence out reo tre onsite! withthe 
(1) Average rice elasticity of suppy i en by, ny =i 3.5 + 2p 4sct 


‘Thik shows that atthe average pice the supply is prce-inclastc 
4) We have to find price elsicty of supply at pie 

‘Since the etimated supply incon ie 
Fedex ov P=3375+320" 

“Now it~ 6, ¥= 51.75 ~325 «9 =3975 ~195~35, 


_ Now by definition price elasticity of supply, hy 
“Tha p= 0366 when = 6, 
Fm the estimated spol Faction we ac tht Y= 33.75 3.26% 
X= 6, Y= $3.25 
ric increases fo Wie if = 8 
Y=3375=325 0 4= 5575 + 26~ 99.5. 
means that when X= 6, = 53.28 
X= 8 Y= 8978 


aig 1325-0366 


Possess three properties ; They are linear, unbiased and have 
Yarince (compared to other linar unbiased estimators). Thus the OLS. 


jemimates @ and jj are lincar functions of the observed sample 


ae 
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2 The property of unbiasediness 
‘The means of & [24 N} and fi (21) can be obtained as follows 


un pet ERCP Lei -7 
ie ss 


du HH -DISA-D Say, 
=H win xe] -#)=0 
Eun $a 
% 
and x) =X)~% for i= 1,2, 0 


ig vere a wos 8-5 [tay 


Ei We now a Y=a+BX,-u 


(e+Bti+m)= aK, BKx, Ska, 
fa genie 
ae 
Be ee = By=0 
i iar 
(4) where 5X -F oxen F 


be 


eo = oe 
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‘swe BBE a, ee prey 2 


a aan en 
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ox eagle tet we same p* = EG whee = +d ian hy et 
wg sie Qt ot he se) to he 
Tce Bt)* tm te exronien of Pond we chia 
peoaosntien= Fac oe 6m 
tis asrumed that like fj, Bis also an unbiased estimator of Bie, ID) = 
Now £"=2| Siocon ec] 
sao | sconSoxs Fen 
om A 

“Now £(9*) = B ify and only if 

Seen Lease ma Eon-0 

ee a = 


Ry ee Sa, 
me mt 


ayetn rd Bee 0 (wSa-0] 
A= EK Dd, = $2" \nza-9) 


rt 


4X; =1 eure 34.4, #0, 
a 


ox =k +d) = EAM DAN, 
2 pr 


Fea -LTGe=t if Dax=0 
| iar} Gl 
te lincar unbiased exit off (with weights C, = K, + d) if 


Fo-0, Sa-0 Eqxet oa Zati-o 


_ « 


==. +~— 
quip SIMPLE LINEAR REGRESSION MODES, re 


since 234 = 0, it proves that Vor) > VartBy ot, Vari) « VariBey 
[1 


“rhus itis proved that fi is the BLUE of B. 
‘case {b) Inthe same way i can be proved thatthe least squares constant intercept 


4: powesses minimum variance, We ake x ew estimator a which we sume wo be 
eRe funeton of the Y/s with weights C, = K, + dy 


‘Siar, wraS(t-ne)y =) 
“This shows that like G., a* is also a linear function in Y, 


‘a? is to be regarded as an unbiased estimator of . 
for ¥,~ a= BX, +m, im at and we get, 


ee a | 
y co m4) -|2-ml Eon) + S(t-2e)] | 


vm a itd only if 5G. -0. SGX =! and Fcu-0 
ae a 


a imply Soa, =0 and FA.x, =O. 
= 4 


ich - @$[ babar] 


ie)-aiondeasts 


==> —- 
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‘Since = ¥~H 
= PVF) = (+BY mR) (B+ fin, 
= pox, Boy OH — 
B= Bh, + mH) where 4 =X) H 
06 f= GBP P+ 4-H -2548- Bk) 
= GBP +04 0-28 -PraH 9H) 
ofa-d-9' Ee -Le-0F-24-0{ Snes) 
a 


i] 5, s2 (i) 
te ® Pa | 
ape — : j-p-Sinn//Se 
aaa Tt 
|= 

4-2 = Eada de 
26h Sm Seat ont 


feo) -ge BT 


Pa 
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seit 
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{TW SIMPLE. LINFAR REGRESSION 
rag tel eid eto of es | 


Fomanseated = 3 0) 
on, Heaton) fl) Manel) xb at 
alee | 


“Taking Log om both sides we get, 


oud =—Blor2x-nigo, <b Fd 


that a is the MLE of a and (* is the MLE of f Theo, 


F pee [semb-Son [ee maar 


ee 


20, Set /n=o*, (oy) is the MLE of the variance of the disturbance term, 
ro 


noted by of. 

[ote : For maximisation, however we require second order conditions But this» 

not shown here. Hut we should also check the second order conditions, requred for 
This Cage, 

inn Hm wee tow a A, Ey 

8 

Sotto 


‘Thus we see that the MLE of © i.e, o2* = -La/ {is pot an unbiased estimator 
‘uti v8 consistent estimator 


ee lnldeho}2] 
lines] =e 2} 


/ab-2) 


ASe fot ware, 


atthe MLE of of ic. $<? /n isa consent estimator of of 
a 


OF and B is., a* and fare unbiased estimators of cand (but MLE 


Yi Gand f) must also be normally distributed withthe following 
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their te v4, 
10) and fare bis estimate, these means being equal 
of sad 
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ston anh Venn and « or 2-850) 


vm ne e] 


1 the same way, for feting we have, 


for of: 
pe normality assumption, the variable Boone follows a 2 


df= (n-2). 
ye can use x2. Wo establish a confidence interval foro}. 


a: eeTROLETION TO EONONE 


ri Ho-ngiaasengt}oe st 


ee 


—— Ow 


nypnheis My © when Ht 4 Arve Traditionally, we Fx dhe valve of oa the maxim 
‘lowable vale for the probability of Type error and then proceed to tet the mull 
Typhi terme ofthe appropri fut write (, 11? ete). The pre specified valve 
‘a (umunlly @ = 0.01, oF 0.05 and 0.10 in some eases) sets corresponding ta olranee 
Time, the ete valves) for the observed feat static Ifthe ealeulted valve of 
‘the tat saint exceeds the tolerance lis) set forth by the pre-spreified a then, 
erected and other wise accepted. Smaller the value of. the more Hera the tolerance 
Ait for the obwerved test static gets 

"Now we can ask # natural question : what i dhe smallest value of (sngificance 
Aevel) at which the nll hypothe, gets rejected? The answer is p-valve (or probability 
valve) sociated with the observed data vet 

“Once the p-value is computed, ope ean make use of 10 came 19 «conclusion ry 
‘comparing the p-valve with a. 

‘Usually, the p-value <a then reject M7, ; otherwise accept My 

‘Toiltutrate it, we consider an example where we have ited the repression eyuation 
Y,= + BX, © mp showing the impact of education on wages, given sample we of 
om 13 where Y= wages, X= Education years of schooling (ihe original data are not 
‘given ere) 
The estimated regression results are given below : 


“Hy +" 05. The appropriate test statistic under 11, = p= 0 wosld be 
“,y-BB 070-05 

ip. 32 
tea seq) 00700 

(observed) = 32 
‘of the given sample Hy : f= 0.5 will be rejected at 100 a% level 
when (Hf, : = 05) i | ty (observed) > Me-2 (Table) 
13, Kf = 005, 69-2" fsassi= 2201 


£= 0.01, Ybn-2= fagass” 3-106 

= 05 jis rejected both at 59% and 1% levels of signifiane as |, 
= 3.2>2.201 and 3.106, 

bass of p-value the mull hypotesis wil be rejected forthe given sample 
ill be accepted otherwise. 

the null hypothesis, that the true coefficient of education (f= 0.5), we 
of 3.2. Now what isthe p-value of obtaining a¢ value of as mach as 


given in Appendix Table-IV we see that for 11 df the probability 


*P yalue must be smaller than 0,005 (one tail) or 0,010 (two tal). 


th the Pn, 
32 }= 0.00001 »,, 
Mere «= 


<> a". aaa 
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we Dob Et Se 

new Sion Beevdt Pr 

<thasidrn rhs -ofede stmt S10 4) 

io So,-1 -Beasin ry 

= Se-inbs rT (ae F-ARy 

= Sli] - Eu, -#F 

SEM E-0F hc-rm Se 


PE a,-¥F +3¢ 
a fa 


; Pia Se where 9 =,~Fand 5, 0X; 
of | _ [ Explained Residual 
ial pie Rape cu] 
ESS + RSS. 
P represents the total sum of squared deviations trom F, which we may 


‘the total variations in Y. 
‘can be decomposed into two pars 


ie ]e6 Be esi eo si Xo he 


‘sum of squares). 


ea a 
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Pool Pid oll 


tv $7 aby 


ml i 


y 
a) 
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Since O< RS 1 

osPst 

on -lsrsth. 

Example 2.4, Find the value of F from the following information and comment 


1 = 3347.60, zi. 604.80, Di- 19837, n= 20, where x, = X,—¥ and 
A fs 


poe 
Now 2-H — ~ 306860480 18561312 4935 


i 


a R=0938. 

Suggests that 93,5 percent of the changes in the sample observations of ¥ can 
muted to the variations of the fitted value of Y ic, 
jon line fits the given data well. 

measures the proportion of variations in the dependent variable that is 

‘the independent variables. 

2.5. A sample of 20 observations corresponding to the regression model 

4, where u, is normally distributed with mean zero and unknown variance 


of we say that our 


following dats = 


—- | s 
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1 
‘Aesin we know tht, Le-Ei-vie 


[ome BAPE AE ste Sf B0- 7 a Ea-fu, #7 
EF 169-0400 -2154-469- 2356-3434 


now of =) = HM og 


=v 
“Now seta = 83 a ABM ag, 


SY) - 47 =2154 of, $x} nF? = 2154 
co 


+ 1733.52 ~ 1948.92) 
svar ()~ 08631 


(9) 0.025 and (b) 0,005 
sp both sides. From table value: gs, 


8+ foms (n ~ 2) SEG) ie, 
ol —2) SE(A) 
Mra be: 8 os. (n—2) SECA). 


—— 


{7B SIMPLE LINEAR RRORESSION Mots, 63 
Here == 3508, = 044, SEA) = 0.929, 515) - 0104 


Ta eager tet vrais i 60-48 poet of te seperti of ¥ oat | 


3 Yq is disposable income, the mumbers in parentheses are 


significance of Y, statistically using tration. 
the estimated standard derivations 


of Marginal propensity 
in the existing theory (2 > 0, 0 <b < 1}. The estimated 


n=19 
=0.99 
ws that @=15, 6-081, 


é 
3.1, (eatio) and SG ~ 18.7 (ratio) 


Explained variation in C_ 
Total variation ~~~ var(C) 


oe ee 


rw SIMPLE LINEAR REGREREION MODEL ae 
‘Tis the existed mendord devietions of he parsmetereatmators wre SF) = 
‘eyby = 0080. 
Yo) We hive f0 content 95 fer Geet confidence interval fr the coetent of F 
Nw be sven by 
Batons 2886) soce 
4, O81 82.110 * 0.0433 feortes 
oat soon Seat - 210 
Le, between 0.7187 and 0.9013 St) «0.005 
‘This means that the coeMichent of Y,, boon 


‘wil ie between 0.7187 and 0.9013. 


Sans orsne Ste firey Oo Lsbe.tow of wach sd oops 
arb w7M09S te 679 
i eu60 79 ou» 

els a hoon reed ot wT 20 6 ri6 NO. of). find 
me © the OLS estimators of a and (i.e. Gand fi) and the estimated repression, 
Hine Pdf. e 
@ Find verti), var, 5546) and Seip) 


Find eva ot Ee and £4 fo n 
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Thos m 
{atcuations forthe Regression 


B=M-¥ yah? fay 


Ea Ban bt 


=m =21 = 
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ne af ait sat i = 0 
ee 


(iy wee ind ut tae of Ee? at ft [en » 


Here Sef = (01)? (1157 + (09)? 65) (1.19% = 14.65 fe van ae 
= 
taken from last column of calculation table) 


sat 82S fin f$85 MBE ora 


{vy We have to find out the value of the coefficient of determination, R? 


tht 
[since $97 = $7 
7 En PEGE 


ie, TSS = pants 


Since we know that A? = 


SDP Late 


70,75 X28 + 14.65 ~ 15.75 + 14.65 ~ 30.40 


nlf 


= 0.75)? x28 


‘that 51 percent of the variations in the sample observations of Y can 
the variations of the fitted value of Yic., J. Here we see that our 


line fits the given data moderately (not very well). 
‘above results we can write our regression results as follows = 


3.6 + 0.75.X, R= 0.51 
2.090) (0.256) SE values in brackets} 


5 SEG) S0<8+ ear. 2° SE(G)] = 0.95 


—— 
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= 73=1.33, -08 
He tos Bb ae” aon 


2.343 and | ty 9 |= 243 
From the thle value We Bt fous. «2 * fomnss = 2306 


[2340 > togagy = 2306. This means tat om the bas of sample dt, 
fibrin Rho of ome - 


aE R hte 2, = 135 whe ere suet he sre 
p> 18 a 1000% level of significance if for the given sample 
See 


‘and will be nccepted otherwise, Here a = 0.05, n= 10 


aad f, 9 (observed) = ri 2M3 < iggy © 1.800. 
‘So, Hy: B= 1.35 is accepted (hence in significant) at $% level of significance 


The mull potesis, My = 1.35 wil e rejected agnins he alternative 1-39 
“petbe gen spl at 10 evel spine , 


ha (meer) ~ ‘sp 
1.860) Here a = 0.05, n= 10 
head B= 1.35 is rejected(significant) at 


BO abhegeeeetee ene ae toe 
odel is the analysis of varlance. This isthe breakdown ofthe total sum of 
‘nto explained sum of squares (ESS) and the residual sum of squares 

ose of presenting the table isto test the significance of the explained 
In this case this amounts to testing the significance of B. 


= Explained variation ~ Unexplained variation or Residual variance 
K=1) + (n~ X) where 


70 A IETHVETION TO HCOWON 
Ube rel nw rwo varable egremann del he 

Hn defin, wot na tht 4m 
Nowqak-h 2 


sa Oe ‘arable fe agifieat factor ofthe vation nh, pero 
Pn by computing Frat crovgh Aeiing mean vom of squares of explained 
1 that of residual In testing My B= O apsinat the alternative 1 B #0 
‘We emay use he tent sata, 
2 
ap (K=1) 
elit Sieg . 
eDe0 | 
= ey MUNA Lo (n-2), Mow K = 2) 
we have 10 compare F*.3 with the wie value of F with A= 1,n— 210 


nin fund that F* > Fy (Ale), we reject the mul hypothesis a 100u% lve of 
‘865 and accept otherwise. The relation wil be significant if 


ea atin 2S 
ete BEES - rae et 
wb Yt-Sd | De 
S St a 


|. 
(que SIMPLE LINEAR REGRESSION MODEL, n | 
‘To \eet the mull hypothesis concerning the significance of the regression (ie, 


IP oe, Sed -0-PEy] 


Gn [PZ] 
ees Do | ae 
ora re 


RK-) 
generation becomes Ty /cq_—g) We we have K parameter, 


ge Ldln- 
ae iF 
Lein-2 


Bu vat = 


+ of = 


(es) 


THE da 
ANOVA TABLE 
st] Sum of Degrees of [Mean sum Fr 
Sersuon | squares | ffeedom — | of squares | Cinarved | ante 
‘variation vale 
1% and 5% 
Foownan 
~429 
with dF TY Fost 


a4al 


=27.55 is much lacger tan Uble yy, 3 18 

{1,18 441. This means that Hy =~ 0s rejected both $8 and 
WW levels of spifcace. Hence, we rect the cull hypothesis and suept that the 
rgeton ent pane Se non 


‘we may have to estimate a regression equation separately for several 
data and we may have to test whether sme ofall whe parameters ace the same 
sets of data, 


Heath 


ashe 

four problem is 0 examine whether these two catmated relations difer 
Tyee, then we may conciode tat the flatosip changes fom ce 

the oer 


suppose that we have the data on consumption and disposable income 
1990-1999 and 2000-2019. We estimate the consumption functions 


€ these periods. Then we may be interested to examine whether the 
-Sttistically significant or whether the MPCs signiticanly differ or not. 


‘cent level « 
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‘ohatton : Suppose the estimated consumption function for the first sample 
Gey hr ad for the second sample C «dy «fir 
enn oemmotin Racea C0047 ‘We have to tow 
pe hopes 


Ho edy mth By against the alternative Hf, © My is not true 
MPC of the first sample 
MPC of the second sample 


(est statistic will be 


_(E4-(84- Dalle 
” Saya nem 26) 


+m~ 24) 
be rejected (significant) if F* > Fy :K, (my * n~ 2K) and will be 


ene, LG-G) = Le -3.281 
LeF = RDF = (10.92) » 3,251 = 260.08 


Nem =30 fy =082, P= A= 098 
PaEdaasae-2 


OOE+266/2 ae ik m+ 
BFGS+30—2H2) with Af = (K, my + my — 2K) 


85)/2 _ 7916.66 _ 999 97 
79783 


Bae 
eto 


ee —— > 
LINEAR REGRESSION Mop, n 


0 estimate the parameters of the tom Fnear fon 

ines son intone frm snd then we have myn rrr ome 
18 method 

wh onder to find out the tle of change (ie the 

3 eset bet converted ints Uncar oem wie ret hod 


logY = a+ Blog. 
logY =a +Ax boy Buy 
Yearplogx (4) o(z}* 


(ay 


r=a+6(4) 


cates that the elasticity is variable, depending on the value taken by 
both. When no X and Y values are specified, in practice, very often these 
are measured at the mean values of these variables, namely, Y and 7. 
Estimate the investment function Jr) = a(r)Pu on the basis of the 


y =12.2771, es, =16.6729 
at 

Sx -15.1202, 

a 


ya? = 2.6891, 
ae 


—_— 


Fe log f= 30812 — 1.12965 log r 
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“The regression results can now be writen ax flows 


jee at sir’ 


070 
um 


<: (ey where a 
show that the constant interest elasticity is -1.2965. ‘This means that 


log of 3.0812 


te for investment is imterest elastic. This result is consitent withthe existing 


“(ually we do not differentiate between prediction and forecasting. We use one or 
‘iter interchangeably. But these two terms are not identical. Prediction simply 


‘an estimation of any event happening (in the past, present or future). On the 
hand, forecasting is always associated witha time dimension in the future he, 
on for some specific Future duration or over a period of time. All foreuasts are 
‘but not all predictions are forecasts, as when we use regression to explain 
p between two variables “forecast” implies time series and future while 
does not. When we are interested to predict o forecast about future, we call 
analysis as historical regression. With the help of regression analysis 
| forecast about the future value on the basis of past and present information 
‘variables (X and 7). In the context of forecasting we may also distinguish 
forecast and ex-post forecast, Ex-ante forecast is a forecast that uses 
available atthe time of forecast, whereas expost forecast is a forecast that 

tion beyond the time at which the forecast is made 
i define u classical linear regression model, given by ¥j = a+, +, for i= 
n with the help of the pairs of observations (X), ¥,),(X3, Ya) oon UY). We 
relationship by the method of least squares. The estimated relationship is 
‘In case of time series data we write the regression equation as 


value of X, (the independent variable), which is not in the sample, 

10 estimate the value of ¥ (the dependent variable). 
Of finding the value of the dependent variable from the estimated 
the known value of the independent variable, not in the sample, is called 


that 4 is the value of the independent variable, not in the sample 
the value of Y when 1’ i. There are two types of prediction: 
(ii) Interval predietion. 


INRAR REGRESSION MOTE. at 
siew coat fy [4-0-9] 
= at ARF +n -W + #[ 46-00) 
ee = Leary 


sy vot) = Fed) 
gf overtly + XG vary - 2498 vary 
som 8 + warty + 8 vay = 2505 wary 6 8? wareiy — 7 vary 


deat) + (9 = TY varthy 1" vary 


] 


a ; 
oF oat] 


} 


i) 03] b+ 
ia 


rt. 
i 


vant} + SEF? vu (ko — FY va) vy 


= Bie) = 03 [1+] (x9 ¥) vay 
‘var(f) all the terms are constant and positive 
is minimum when var(p) is minimum. 
that, var(f) is minimum when jis the OLS estimator of 8. Hence, vartiy) 
ywhen fy is the OLS point-predictor of Yo, This is the BLUE property of OLS 
‘It means that OLS point predictor of ¥, Le., fy is the dest linear 
Ue 
‘error is defined as. B(%)— Yo = ey. We know that, Yo =a + fig +My. 
a+ Xp because E(u) = 0 
4 -fiky 
(B-f)Xy = ~-a)-G-P)Xy 
i +B-BPXG +2X9(-anfi-B) 
E(G-a)? + X3E(6-By? + 2X E(G-aXih-p) 


Ho E0%)| = va) + 9 van +2Xy 0.) 


— a 
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| 
rw mene Lote ary 


ee coe 
ie 


stom means that iy 8 normally ditrbuted ith mean Fy and vaance 


af distribution with (n ~ 2) degrees of freedom. 
test : If the alternative hypothesis is Hy: Yo # 4, then the null 
the accepted at 5% level of significance if ~f mas. " ~ 2 Sf Siyoose 
I be rejected otherwise, 
nat is H, : Yo>A then Hy : Yo ~ 4 will be accepted at 5% 
‘ft (observed) ty 9s, 2 (able) and will be rejected otherwise 
hypothesis is H : Yo <A, then Hy : Yy™~ A will be accepted at 5% 
if 


| 2 ~ ings,  — 2 (table), and will be rejected otherwise. 
1 of the null hypothesis on the basis of the sample data implies the 


‘that the same procedure can be used for 1% level of significance, 


un si LINEAR REORESSION MOO = 
sirative hypothe sy PLT)™ Ahem iy BLY) A wi be secepted 
agi ee eine (aA S yy 2 2 (tbe) and ill be rejected 
-~— hypothesis LT) <A, then My B(7) = 4 wl scope 
ro sgniicne 1f¢(obwerved) ® tgp, ~ 2 (ible) and wil be rejected 


Fen The same test procedore is applicable at 1% level of significance 
be seen that 100 (1 ~ a)% confidence interval of ALY,) would be 


1) wher erally thes the vale 0.0 0 0.01 


12.42. Following example 2.5, 


10 find out the point predictor of ¥, when X; = 10, We know that the 


(where & =—3.505, f= 0.494, See Ex. 2.5) 


ave fo examine whether ¥,~ 165 when X;~ 10s justified or not. We have 
ul hypothesis Ho: Yo = 


RRORESION MODEL a7 
AS, FUT) 188 


—eeeanee 
Be fone B3 4M me 


10, Fao, SP arise 


486.581, 
basis of the yiven sample the mull hypothesis Hy : E(Y%) = 18S will be 
6 level of significance if ~Iuansn-2<ttyqrs-2 and will be rejected 


=Fnoasin=2.101. 
= 486.581 docs not ie in 


the interval -2.101 and 2.102 and hence 
aaa i be teietcd. $0, ELY4) = 155 when xy ~ 10 ia not justified 
He 2.12. Consider the following regression model Y, + BX, +, where u, 
isbuted with mean zero and variance G? (unknown) We bare it 


F not at 5% level of significance, 


j= 2802 and SE(3) = (War = VERE =1 674 
ratte] $8 =48-o1105 


11675 and sep) = ard) = JOTIOTS = 0386, 
for and ji: 


‘will be accepted at 5% level of significance if ~igqasy.3 5 
and will be rejected otherwise. 
that, 


90 
(0) Test for | If we tet the nt 


evations of ¥ ce 
jean say thar ow 


@ equation ofthe 


Ff ~46-12"3~46-36=1 
‘the estimated (predicted) sample repression equation is F =a +(x, ~ 


‘ean be obtained for different values of X, Since e, = ¥, — 


VL and-¥;=3, ¢=3-10-12*1=30-22-08 
2and Y= 4,6,=4~10-12<2=4-1-24=06 


—— 
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“ee prison 6 ail gen by 
Hebe 2X 16 that when X= 


<6, Hye 10612 «6 = 429) 


ia, Sz0eniiEpon) fT 


oe 82082112 1795 


see $20 £5.72 06, 248 and 13.92 
‘5 95% confidence interval for the average sales (£(f,)) corresponding to 


£600 would be © 2480 and ® 13920 

"should be oted that this confidence interval 1 narrower than the one we 
for Ys 

“trample 2.14. The following table (Table 26) gives data on the level of education 


‘of years of schooling), the mean hourly wages earned by the people at each 
of education and the number of people at the stated level of education 


RaceeyBsasaue 


a+ BX, u, [4 ~NO, 0D), 


nt) 

aie 

BDI\0m, 
orn 


“LINEAR REGRESSION MODE, 
sd. X, = 6, then from the abave relation we get 


pai te 


fF = 8.6747 -0:7240067 » 12 ~-o.01445 
144 and fp = 0.7240 
ive to calculate the values of var (4) and var (jb) 


BDA 
gt var (G)~ —— . Here 92 is unknown and hence itis replaced 


it 


asd eatizator 53 = Se [n-2)~ 08036 
= 


.0.8936n2054 1835444 _ 
es sa alge = 07757 


ne 
n-2)-080 


= 98935 9001910 vari) = 0.004910 


— 


que SIMPLE LINEAR REGRRASION MODI, 
“when X= p= 20, Yo To~ 
Igjeamtrenin 2 Yom 22 (Tbe rave a = 005, 4 = 18 


0144 + 0.7240 « 20 = 144656 


“pain X28 Fn. Ld -12. £4 fan -o8ne 
a es 
At = 9) = 9% (0685 cn ira oY wl 


fidence interval of hourly wage rate would be § 11.9788 and $ 16.9524 
ff education (years of schooling) is 20. 


= foqa5,1) = 2201 (Table value) 


jam me Sa -10, Sa | 
ee? 


|-2) = 0.8936 


— 


i a - | 


291.78 + O71 K, wm, when = ASO = 4, 


1,784 0.71 * 930" 698.26 $499 bon 
expenditure Us 2025 will be $435 bilon when the incre of the 

billion. 
Interval of the predicted communion expenstinrs of he country 


eps zaaOr 
hl 151,482. 


c interval of predicted consumption expenditure of the country 
would be $$83.5116 billion and $686,4884 billion. 


EXERCISE 
near regression model ¥,~ a+ f+ fr = 1,2, —.m why do we insert 
explain the assumptions of clasica ica erosion model (CLRM). 


near regression model ofthe form Y= c+ BX, + ay ¢~ 1, 2, —, 0 how 
the represion parameters and f!? 


estimate # linear fonction (two variable) whose imtercept is zero 

estimate the elasticities from an estimated regression line 

‘the properties of the least squares estimators relating to a two variable 
‘model (CLRM). 


mean and variance of @ aod jj relating to a model ¥;~ a BX; +m; 


A 
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[Elna «sand enlewnteetinaen of varies of yout tian, fatimate the 
‘conditional mean valor of ¥ voresprating fo valon uf Fined at X= 20 
_A1 The following able shore the neous penne sa he ong Po tlre eae vet 


‘he tn yor period 

el $B 201) 212 IY 2014 2015 2018 OTT JOKE INT 

Wevenment : 686 08 m6 14S 7H? Thi 185. a? oom 
Inert 008 9045 0045 05s 006 m6 0mm O05 O07 OKs 
“este hype hat avetment ere! sa by iting een i 

ove din and ondcting the relevant tes of signe 

Given the following date 

BAY} #200, af =100, X97 300, F = 100, F150, «= 27, catia he parameters 


model < Y=a+X;+% and sew the hyputheis My © B =I againet the 
© B# 15,0) 8» 13.00 p< 15 
‘Relationship between X and Yin the population v6 grven by. 

2424, + 4, Suppone, the vale of Xin the tmple of 10 uneven ae, 2 


‘mean and constant variance 
My = 0.06, oy ~ LAB, Hy = 1.02, uy = 1.3%, og = O91, wy © 1-50, wy = 
= 0.18 und mp ~~ 137. 


the least squares estimates of the regression coefficient and their wasdard 


the predicted value of ¥ for X= 12. 
data gives the production of cal and the number of wage eamers in the 


2108 210.1 211.5 2089 2074 2053 1968 1921 165.2 1768 


TLS 699.1 6974 7953 692.7 6302 6021 S310 
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0 th 


10 eens an aan of Hd ¥ te Ha 


roe, 


Bom Ee 07 stam Bie, ten the cee 
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oo 


2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 
o 2 & ss 7] om OO OO MO 


190 140 160 170 430 162 18S 65 1901S 
the linear cost function Y= a + (ix 


"AVC, MC and AC, and plot them roughly on» graph 
Investment function forthe ecooomy as « whe ix assumed to be ofthe form: 


investment, 7 ~ rate of interest 
‘sample is given = 

99 55 85 40 35 25 30 15 12 18 15 
DA 4 6 yD 
the parameters of the investment function by OLS. 

‘the statistical significane of the coefficients at 1% level of significance. 
et 2 95% confidence interval for fi. 

the value of A? and interpret the result. 

‘down the assumptions essential for each of the following tasks 


that the OLS estimators are unbiased 


regression. analynis we Study the relationship between an explained 
vaucy i Splinaery Cndpenden) aaa. ule 
Xe, bythe ttonship verween Y and momber of explanatory 
oS Ty le Tao 
—m qwantity demanded of a good and price of that 
‘income of the consumer.in fact this problem can 

‘ultiple regression analyse 
om model where there are K independent variables 
Only dependent variable, In this cave the regression 


PBX ts Dee, 


4 ty Where Fm 1,2, 5 ov 
Pas os Bye are 
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Im vectors form the shove se of equations vane writen 0 
Yeapew a2 


and is 0 const 


sna, 0) 
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oh 0 ° 
qed ° since Bin) = Ofori 2) 
sj0 OR 0 dE (x?) = 03 for alli, 1-2e 00 
0 a 
° 
® 
1» 0] ef, 


isa multivariate normal vector with mean O,,., and variance-covariance mate 


the independent variables X,, XX, ate on-stchastc or nonrandom 
nor-sochastic mati, 


Now 2° is a matrix of order (X + 1) « m and (X°X) is a matrix of order 
AD), ME the rank of X is (K ~ 1) then rank of (1X) is also (A + 1) and 
rank is (K+ 1) and | AX) ~ 0 if its rank is {K+ 1) 
EA! doesnot exist 


Method (OLS) for Estimation of 


3 i 
zo sh] =A} off, 


form the equation, 
p+; fort 1,2, cam be writen a8 Y= AB + 
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gerbe2eT op<veryler 


the result derived move clearly in erm of 8 ree variable linear repression, 


have, ber er 


Bh Eayxy |" (ayy, 
ym tt | [Ean 

explanatory variables) we get wo nema equntons and svn 
‘we can find ut the valves of ji and 

My 


ad) CE) 
= Transpove of matrix of cofactors of UX). Nom mats af cofactors 


ae [8p Edt, -ftayrs 
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= [Shes] 
teh eftnvty =2y——0) 


Bard = Ley ——— 
equations (A) and (It) by Cramer's rule 
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Hy Fi + Bala.) 

Hata) Where 44 =Xy-Fy ay =¥y Fy, not F 

FoF etna, 6 =», Fi= ¥/~thisy Ba) 


a 


ry +t, ch) 
(1) and (2) by Cramer's rule we have, 
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‘Solaon: Cations forthe estimators of the regression parame 
‘Table 3 
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nthe expression of , we eet 


here y= Ky = Ky — and <0 
pon for = 1,3.) WBC 
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‘sah. ymths ov fish covi fa ont 
replaced by ity unbiased estinatur 6 = Yel |(n- 


gable reson data on sap five peso rh 
(eal! mma 
withthe i ty ase working fr (1) 


EX Ey 
=50 =272 0 


Here n = 5 as five sets of values are given. 
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(0) We have find oat he OLS estima ii ad fi 


2a 
We kn out B= = 
oh ST od val 
We now pu eyes form the calainn ale ad ge 


Sere set Ee a 


sent, jE Baa Ha Ea, 6x10-03"° 16 


‘When jy and ye noun, ics be bund une relan 
fy FAR fof, 30-(-029 «5-554 10 
ss fy 30+ 125-38=-m18 


‘Toth OLS stints of he preter ae fy=-207, j= 128 
i) We veto find out te wae of rot net creatine! 
or explanry variables and 5 be Prt 


0025-15 tears, 
24.34 - 10 


A095 - 15+ ons = 9538 
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‘Siri, veh) = 
= yandh,)= 0.75 
tas oath) oR FF oh) 28,8, vc fi)» Nova) 


Wert oboste BE wors.ans Khas fy o10 
seh) 0.4087, vary) «025 nd cord) -0.5625 (obnainet tron 


BELA Gyyy 16x10- 


(0) We have to find ot he vale of ents) 


New enh) SP. Hae of tt ot 


‘replaced ty it waiaed etnstoe 2-2 sors 
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the model takes the form = 7 = 3 + w 
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‘estimator of Bis wn unblaned extimator i, Eq) = 9 


fe Foo} 


‘with two explanatory variables we have, 
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‘Property 2. The Dispersion matrix or variance-covariance matrix of |!" gi, 
olurayt 
rot Ry definition dipenion max or Varinace-Covariance mats 0 
by oxi) = eth pah-pr where EG) B 


Hiiy-to)? —B-ByXln-B) = Fy ~ Pr vy Po! 
Bh BoXr-h)—BOy-BYE = Ee Ba 


Fby-bibe-be) FH-BBc- Be) ~ A y-14? 
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= abl, (See 3.1 1) 
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ane 


unbiased estimator ofthe 
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ox BB+ Sem 2 B= DOH 
a 


Now, ar) =A FG) = 


tie] 
water 


Now we have 1 mlsiise veri) nabject fo the condition shat 
‘trough the choice ofthe vector C In other words we have 10 minume ° 
subj to he condtion CF =e 

For the sake of simplicity we put 03 = 
|The Lagrangian seven by, 


beh Sct-tcx-an 


n 
2 


REGRESSION MODE, 
qoexy ler 
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fe 
5 
[ BOX LY afi 11 00-0) ~iy 
i 
that under the condition that an unbiased estimator of Bo. 
‘ve var (A) is minimum when iy 


var (93) is minimum when 


(Bie) is minimum when yj 
‘that OLS estimators are the best linear unbiased estimators of 
Jie, fis the BLUE of f. This is known as the GAUSS- 


Linear regression model (Y, = fy * BX) * BaXy * He 
B i the BLUE of f) where 


te 
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ie, ee YM = MY 1 enity matrix 
= (psa MAB +H) where Afi idempotent my 
= XA IMCRB Su) for which A = 7 


YANN MR BM 
Now if we put = /—X(WAT IA" then 
PX MAB WD PMA =O 

tad hence &e=w'Mte = ¥-XOCXY 1 


elle wX RYN oT 


#/Blee)= ZB?) Blu trace moery'ay] Blu?) 03 and w 
= ng} 0} (K =i) [Mere trace X(9) EX =(K +1)} 
lee) ont ( +1), 


writen)? = thst 


‘This proves hat =H Ea e whined ein of oH 
to prior for» ear reson model wh wo exlnary varie 
Taf Bi Boda om 


0} in the Multiple Regression Model 


eBay to +f Mao em, for = 1,2, 5m, be the equation 1 
sion mode, fn valor mats for the yet if eston >> 


{prebabiiey density function) xf the 
he PAL of «for i= 1, 2. 


«fh few) 


ie 


tes nee B= Bo Br.Bielicsiyet 


ie 

joy =: Mi 
wy 

Pate 1 te meni wih pen 

et whl rpc Bvch 


‘with respect to 
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Welnew at FeAB Ye mee aD 


Peer amecr-prw— an 


Wee NY-PXT-TABBRND 
Now fink ore combi of mums will equre 


= Aav00 


oc Mepenrroxrssrm 
DUT TEND 60 [For deta see the derivation af OLS of paruonetcr 
4, 207 REND 
fe rxpea ag peutry'ry 
Soar oi pony CRYST sh ated he 18 
me nh pop 88 
0 he eM 
In re ME bt 


“Pits ehgo, EE we ge 


ar 
ae ay 
laf cra tv O18 08 


that MLE of 3's not an unbiased estimator of 3 Dut # 
unbiased estimator of 03. 


Le ie 
oe ecm cain) 


tare is an unbiased estimator of 3 


{with two explanatory variables, ic, 


MLE of o} Ze In and unbiased 
a 


a By 
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45, Expression of Multiple Correlation Coefficient iy 
General Linear Regression Model 
Let Y=BoePil+BaXa ABN HoH +4, be the rx 
whee 12a 
‘Wee is repressed om XX, Ap So the multiple creatine 
ood by the symbol, 


mit) BEM 
Fite We “Sachs 


Pua 
Bee tise 

LP rar 
Ba 


where F=f and 


rY-W 


= beta tte PD oath 
1-7 iad 
thew Faas cniery | PO?” 
b-rxrxy" 

3.6, The Multiple Coefficient of Determination R? and the 
Multiple Coefficient of Correlation In the Tavce-Varlble 
Linear Regression Model 
trove rw ove ve ha a) meme te gota 

rues vn (= 8 = BY, up = 1.2- fh tt nt e0 
Ta pocomags of eal caus dg wl Fé 
pansy ve Ti ea fet Sel en 
ib opin o nen Feed ye sn 


i ace ee Eg nan 
ia sa a ecipa  ae 


ine (F =Bo Hb Baa 612 
oe fe exited wae 1 
ecient of ae Ay) and 
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Since Exe; =0, Exe; 
where Ey (9j— fry Bae) 

= Ey B24 HEH 
=0 

which follows from the first normal 
‘equation 

‘Similarly Ex,je; = 0 
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txample 3.3. (nFlowing nape | 

(0 find the va of 

(i) Fea he fied vegrenin og 

(9 Fatowing Example 32, 

(0) finde vale 0 #2 

(0 Fi Be fd repeninn oun sn tert 

Sehition 

{0909 Prem he aleon ble of ample 1, we get = 20 
DPM Heather we beamed fly 636K, fy 806 and i, — y 


iran ban 


= MBH HK-O7T 16126316 
ee eons 
Roam 

(00 The etre (e) rence epton i 

Relay obey 

%_ f~5368 + 0506 x, 072 4, and 8 0.76 

(©) (From the clusion tbe of Example 3.2. 

MEMS Dah A Egy eS oy? 2272 

Further we cbs ji 


amportant 4 nem decreasing function 


variables or reyrewsors present in the mel. unten the 
collinear with the orher repressors , wx the numer af veyresscrs InereAneS 
ly Increases and never decreases. Seated dtTarently. an wiitional X 


merc ot creme A? 


204-FF 
“ae}, however, depends on the number of repressors present the rset 


ic is clear that as the number of X variales increases, 

[not increase) ; hence X? will increase, In view of this, (0 companne 

Is with the same dependent variable but differeing number of 
‘should be very wary of choosing the model with the highest R 

two R? terms, one must take into account the number of X variables 

odel. This can be done readily if we consider an alternative coefficient 


is Wty to decreme 


KD) 
Bi 
in the model including the intercept term, In a three varble 
Variables) linear regression model, K = 2, so m-(K + 1) = = 3 
|is known as adjusted R?, denoted by R?. The term adjusted 
of freedom (4.f) associated with the sums of squares 


‘where K = number of explanatory variables and (X * 1) 


+ 1) degrees of freedom in a model involving (K + 1) 

term and TSS = zy? has (n ~ 1) degrees of freedom. 
‘written as 

e De? | (n—(K +1)) is the residual variance, and 

or of true o} and 


1 
=s(¥,- 7)? =—l_s)2- ies 
ro FaTD? ~ sample variance of ¥ 


. 
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To faa 


a) 
ke 


ete Ge), 


a the cane of tree variable lena rgreson mene we have K = 2a 


Peo-ryet 
rom this tint cee hat for (K+ 1) > 1, JPR? which 
sume of XC acubls tenses teat cee se th uo 
JF can be sepa, about # i oevesaiy tm egaive Incase 
10 be oepative in an apleatin it vale isan em 
1 shoud be oe tain i are J and wl ot ter uch Ht» 
samples ifthe sumer of repens) ae elation tte apes 
{wil be ouch mallets Aan ean even sue naive ales, 
|B sould be interpreted as being eq 626, 
ote: Comparing Tao A values 
Tes crucial sot that a cempring Yoel othe a of te 
of determination, wheter ase oth ple 7x dhe Spe! 
‘mus be the sme te explana vibes may ake ay orm. Tht othe 9" 
Woah By Btu By em tA) 
and omagemyy rantyoy (8) 
sme gpl tem ct op The sa by ini. 
‘the proponion of Be van te dependent vreau 
‘vas. Therefore exon (AL meas he pron 
1 eaplaied by ¥, and wares quan (B) ses 
aio ia Tad hence to ar nthe see tig, hogs 
 popatonl change in, huge Yeoh 
veh )/va) 4 008 equal 1 vatogf/vartiog 
of eit ae the sae 
1. id the yale of Agus 


LINEAR REGRESSION wore 1s1 
1 0dee 
184s, 


s 
Bpete-OO EH 
ne valve of mijunted A? =H? ~ 0.59 

ee that P< Rie, the value of adjusted £ is lower than the onadjusted 


1 o4e~osa 


Brongn by Basar 
nes 


id 
& 


ete (t=R)2=b = 1-1-0994) 


9,012 = 0.988. 
12 = FP 0.908 which is sates than yandjsted 1? ~ 0.994 


1¢ of adjusted R? =F can also be obtained by uning the formule 
in=(K +) 


i 


nalyas the coefficient of correlation ris used as a measure 
between two variables X and Y 


woo) 
model [Y= Bo +PiXy+B.Xa +m), (= 1.2. 

Coefficients ‘yx, ="12 (correlation coefficient 
‘coefficient between Y and X;) and 


and 3) 
called gross or simple correlation coefficients or 
‘and computed by the formula 


weet wat eof om > 

Th gee wagon te loving orton 

ak ROI eBay Om Ma et the st 

vty mm Fo 1 hg spe tie fy 

‘oe te oe swt PB gre i, oF 

{dom af maton hemor Fad Xr of 

{My sve «ae egret ht ae! ae 

| wt we cela ice that a 

Hy Ky Sah ero oi a aa 
‘ert corte water | 


LINEAR RRGRESSION sone. 


’ wort 139 
Fare wncomeaied. 
reson rik MY be called the concent of partial determination snd my 
ns the proportion ofthe veriaion 


‘explained by the inclusion of, 


2 T not explained by the variable Xs 
‘eoeficient of determination), 


oo he rool Comceptaally 1 siete? 


Felations are found between K, simple comelation coefficients and 
‘coefficients 


‘ut earlicr that R? will not decrease if an adsitional explanatory 
into the mode, which can be seen clearly from the equation 


on a 


%, jointly is the sum of two parts = the part explained by X, alone 


‘partial regression coefficient m2 
fers of ra. ry and ry. 
‘table of Example 3.1 we have the following values 


£24, Dairy, =17, Bjy; =20 and Eey,y, =—3 
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. 2, - dae 011 
2 ihe Serer) 
' 
aegis 


yg tO ry == OID an rg = O86 


comer ee wo 
omei-019enss a7 
Seecamecaan ee 
pocan 


(6) We ow tat, 


‘Semis, 


Re 


$0137 -2<0.646 (01) 20.5% 


= ese 
= RaIN gO S158 a, 
B= 07% 
Example 2.6, Are the flowing ats cease? Give reasonk, 
0) ray = 09.745 =~ 02.73 OF 
06, roy == 09.7 = 05 
From the above dat set we wil fit cleat the yale of 


“s 


13.7. Prom the following date estimate the partial regremien coefficients 
Moor errors, wd the ljunied nd unadjonted A evince - 
sepa, Kr 402760, Ky wo 

= 660A2209, LEX ~ yy? « nant one 

= 280.000, (7, FWA, y= 2477 346 

fay Ha) 4280.900, L6Ky ~ Rie y, ~ Fy) = 4796.00 and @ = 15 
given results are related to a three variahle imnear regression nrte! 
Bo + BiXy * BXa, + fp By on 

are the partial regression coefficients. Assuming ¥-F =r 
| Xa — Fy = x we can obtain the OLS estimators of Band fy a 
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Here es mknows antec i 10 e replaced by ss wn 


a 


7 

Smee hf = rif eng ie TSS - ESS + RSS 
i=t)-oF 

ster 12-855 ita ofsbasn 

‘Stace f,~ 0.7245, j= 2.7162 and Tay 7)% 7477846 and Li,» 


+ Bi stag +Batran 


FE MAH 180g 


Ne what at 
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om 
\ 
4 Hore » 
ante aivost the sme 
ce Intervals and Hypothesis Testi in « Three 
Multiple Linear Regression Model 
JB multiple linear regression model with jwo independent variables 


By +B Nut BaXai tm. 191.20 
sifiei2 Wd Gz wre the OLS evimators of fy. hr wo} 


This is read 


Buy) where (fi) =p and van) = 


with mean fb, and variance var(j),) 
var(a)] where (fi) =. 


+28 8, cov hr) + XE van) 


= 0 and var (x) =P 


following hypothesis 


Hy Nip = 9 against the altemative 
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146 7 
‘When of is ankown a it replaced byt eb ema 


nis 


will low # drtion with 4 =(n 3) Under My By = 0. the vest va 
be 


thes the appropiate test statitic would be / 


“Hinad 


Fer he serative hyphens Hy fy #0, the mul hypothesis 
‘be mecepted at 1000% level of klgnificance if for the giver 
Hind SIS fey wn wil be rejected otherwise. [ie when /! 

{@ For the alternative hypothesis #7; fy>0, the mull hypothesis My {iy 
cepted if forthe piven sample £34 wil be rejected ster 
when > 

|W) For the akthernative hypothesis H:fly<0, the mull hypothe» H) |) 
bbe accepted if forthe given sample ~ta.-) St and will be rejecte! 


Aha em tu) eh dec conn level fv 


tens iar fy 
eg ean ttf fy 0 ef = 
‘the confidence limits to Pp would be grven by 
bot _ 168) 
be eke st 

beige 


<p By +t, 288A) 
the coniece coeficet 


et f° ait Be en 


Sa = af [in 3). Them te vet sates tecemen 1 
wg | = 0, the Yet watts wt be 


©, the mall hypetenis Mg y= © wil 
If foe the given sample ty. 51S 1514.4 5 and will be rejected 


” Wypothesis H, : B,> 0, toe mutt nypothens My , - 0 will 
Af For the given sample 10.5 and will be rejected there 


y 

Trpothei ,<B<0 wl hypothe y= 0 wil 

ei forthe given sample -ta».y and wil be react nerve 
0.01 oF 0.5) denctes the chosen level 


of interval estimation of i, at 100 a% level of significance 
would be given by, 


full hypothesis Hy : fi; ~ 0 against the alternative 
> or, Hy = fy <0 


FGs)=B2 


cs 
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Under Hy: Othe est static wold be, 


(Forte seratve pbs = #0, the mall bypotens yf>0, 
be accepted if rte given sample “9 44.) and wl ee 


thers 
(@ For the aleeative hypothesis H, => 0, the oll hypotess Hy 
‘be accep if the iver maple £5 ty and wl be rejected ot 


ep ee 
to sce fi th ives mime ~fy) $1 an wil be reed hry 
fhe when < fy 
Confidence interval for 
As regu the problem of itera estimation ofa 100 evel of sein 
rctiee ens iematinee 
Batt, 58h) 
Mtge Stns"! 


2 8H) |= Toe 


ee 


LE LINEAR REGRESSION MOOR! “7 

wnknOWN aed Min replaced by i anbiaved extwmarce 4] = et /(n— 3. 
ro Bicb fi). 

“satiate would he — wr wd 


Bmp Bay we with jet repace Se} fin 1) pscw of op} anc other 


Bye fy watt 
and wnt be reyecters 


the alternative hypothesis 1, : 5, * fb, the wull hypothesis My, = fy will 
jevepted if for the given sample #5 fy») and will be rejecned otherwise (s © 


hypothesis H; : P< By, the null hypothesis Hy y= 
dif for the given sample -ty,9-y $¢ and will be rejected otherwise 
<= tyy-3)- In each use a denotes the chosen level of significance 
val of (B, ~ By) = 
significance, the confidence limits to ( 


walt 


Ba) would be given 


Ri 
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Therefore we can we 3/5 1 establish confidence itera! foro; 
[A 10 a level significance the confidence limits 0) woul 


(nS ant (9 -$E wt 1 vase tabn fr th ew 
% hy 
“> 


i Pgystdsem tne 
oe rigso-n8te | 
va rngistceng inn 


where (I~ a) i the condense oveoest 
ample 3. The falling ale oto bsrations onthe unity Jen 
sera commodity (Pi ie (Xe §) and corsa’ nce 5 0 § 
ro SO wy wD IO MG a 
Ae SE 6 Rags 
ee ee ee 
‘Asmame 2 Viner epsioe opie of te fom 
HB My DX rm. 1= 123, 

1 Find the OLS estimators of iy By aad BL, fi. by ad fi. 
(0 Fi i and ated 22), 
Gi) Fiod sr) vata ya) 

yh Sh) x08 SEG) 

epson rls he mney fo 

aes 1, ly + Um Gnd 0% ad 9% conden els 


Calculation Table 33 
2 ¥ & Xm AhRP yak me Xa-h of aoa, ae 
1 100 «5 — 1000 » 4 200 ant ‘ais 2d - to, 
2 5 7 600 s 1 200 B 1 4000-5 (1000-200 
3. 0 6 1200 ) Oo) 400 0 0 16000 0 o 0 
4. D 6 500 -10 0) -300 0 800 90000 03000 
5 0 8 30 30 2 500 900 = 4 «(280,000 6018000 -1000 
6 6 7 40 -15 1 400 ms 1160000 1S 400 
7 9 5 1300 0 -1 00 10 = 120,000 10000800 
8 100 4 1100 » 2 300 400 4 90,000 Ww 000 000 
Ld} 110 3 1300 » 3 500 900 9 250,000 aa 18000 1500 
10 wo 9 300 20 3 500, 400 9 250,000 0 10,000 1800 
tO EY, EX EX BH Eu Bry, ye omy Bh vy Sayty 
=800 =60 =8000 =0 =0 0 =3450 =30 = 158,000 = 300 050) S900 


ove 


When jd. ae known can be ctnined fee terelatice 


Flys obo 


been 


Thee we have fy 111.7%, =-719 and fy 0914s 


oI-0602-1 ainda taer 


BAP -o807 
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; 
p01 2% 2.0 
het Rt 


308216 
72390000 ~ °° 


‘ut these values in the expression of fly and we 
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‘Sete, stiBz)= om) = VO‘000124 = 0.0111 


st sty) = [otiy) = 5553-21570 
(0) Te repent in sary frm 
Hath 70-719 +0.0143%, 

8) 2837 @39m) oI) 
0806, Ad = F-08637 


{vy We have o et the wal ype y= agai the alter 
+0. The appropriate est stainic under My ly = 0 would be 


as 


120 
He et = B= HL apy 


1 ml pas My y= 0 mi be sepa I te Be 
1S and wil be rect oth 

Whe 8 ns gn =y = 238 

‘sy = 2345 ale abe) 


=1-005- 055, 


LINEAR REGRESSION stones, 153 
canine Heals off, woud be 
fot yi rstthor 


fp thns5R(0) 
104 3499 = 21570 
we Migs 
9% 396 and 194.1714 
1 29 egotidence intervals Off, are 292286 ad 194.1714 
Hest the mull hypothesis My : (5, = 0 against the alternative My: fy * 0 thE 


under Hy =By = 0 would be ¢ (ober) = sia 
EH) 


£1) ~ 0 il be accepted if for the given sample “fj »-» $ (observed) 
wil be rejected otherwise, 
ofa f0ns.00-9) *fasas.2= 2.365 


= ODI, fyn-9 = omn57= 3.499 


cua) = 2104 dos ot icin te interval 2.365 and 2.35 
j= Os ejected at 5% level of significance. ut «(observed)» 2.804 
und 3499 and ence My: fy =O is accepted at 1% level of 


Tiesits to f, would be 


a Hy #0 the 


a 
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nein = Boo 


Ay y= 0 wil escaped i fre given spe 
wi Be scl therm 

When a= 895, Syn "arn? = 2.305 

sat when = 1, is "Mar 409 


ae eae 
‘which thes within both the imtervaly 2.165 No 2.365 and 3.499 w 1448 5. 
7» 0 m sccepted both at 5% and 1% levels of significance and hence ines 
‘Now 100 (1 aifh coaidenrelntrvals of fly woul! be 
Bt 9 st) 
when @ = 00S, ten 83% soatdence itera ff would be 
B:2har S00) 
& 0014822545 «00 
00143 20.0262, -10077 wd 80408. 
So, 99% confidence intervals Of are -0.0077 and 0.0405. 
When = 00. then 10 (1 = P= $m cahdenn meray wou 
Br hoes7 Sh) 
c 0014323499 0.0111 


05, yfn-3 * f0105.7 = 365 
(B=OD1 bis =may~ 3.49 


pe ee (OSE) = 2526 fre ny 


== Bs rejected and He 0 he ierval -2 65 wo 2.36 


By * By im acepted at 3% 


Imerals ofp, 
Bh) tty. 1 S86, -j,) 
TOD (= 80% 99% condone intervals of, p) would be 
B= ons SEU, jy) 


8) ae = 13.2526 end 1.176 
100 (1% = 99% confidence itervas of (By ~ B) would 


of (Bh =) ace 16.1232 and 1.7146 
9946 confidence intervals of 0} 
Iitervals of 3 would be given by 


‘where 1° values are taken ftom the table 


4 
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‘$6, 98% confide itera of 2 would be 


ea ; 


$224. 255224 | 16,57 
SSM ares 


So, 99% confidence intervals of ol would be 
ta gh nm Be ag. eos 
ee 


3.10, Analysis of Variance: 
— (A20V4 na tiple Linear 


‘Yet aotber tm thats often peesented in conan withthe tice vari ies 
Toprenice model te anys fwartance. This phe break dw of he == 
cee Tie (5) el a a 

‘The sited ee vale was reson Ln wet he repression 0 

Bo +PiXy +Boa riot 1,2 mh is given by Hh =fiy oft fhe, a0 


tad 
om me we be 
Bak BN fas Bak 
Bata Fa 
eee and tay Fs 
=n Baty 586 
+P 


Se 


io} a there ar tree 


‘Gut whether the explanatory variables (X, and X;) do 
influence on the dependent viable Y. Formally the test 
‘Of the regresion implies tating the null hypothesis 


titemative hypothesis 1, -not al are zero, We ray 


= 


~3 (Hee K=3) 


wih the table vale of F with df = 2, 0-3. 10 


a. 


pee TRODUCTION 70 B00% 4p 
158 Mt 


He Fe Facag-s we aceept the ml hypothesis that i WE ACEP tate, 


Thee an namae reasons between the coefficient of mullite deter, 
12 tad the Fest wed inthe analysis of variance. Assuming the normal di 
fir te dnrances wand the ell hypotbesis tat y= BB ~ 0, We soo 
ESS/(K-1)__ESS 
P= RSSTin“) BSG 
in dinette a he Fdistebation with df= 2 20d (n 3) [Here K = 386 thee, 
‘thre pacar ncading the conan cere tem Py 4 thee vai" 
‘eqression model 


ESS/(K—1) 
RS) 


Now we can write =F = 


rmamber of parameter inthe near regression 
my explanatory Yale, When = 0,70 Te 


Bi, = 0 is equivalent to tet 
eam he 
seems oan) #200. TBe ANOVA ile cana Be wie, 


iy Br A Dy re not ver 


ln vesting the mall ypoehenis H,  B, ~ 0 aguines Hy 9 + 0 and Hy 


mae 
Say seh) 
Hest Ho |B, = Bs 9 Wosnty) agri My and fy are sot ver 


lt F Matiatie and ANOVA. The calculations 


[ANOVA TABLE relating to demand for » commodity 
Table 3 


gst BO WE wd wane 1 


Observed Tabolated 


with a 
= KA) (K)| Foo 


[with df (K — 1) ~2 and (n ~ K) ~ 7] 


PF with Af: (K — 1) =2 and (n- K) = 7. From the 
3. 1A and Fy. 29 ~ 955 
) = F* = 29.72 snd Fg 9-774 
we 
the mull hypothesis Hy: =f = 0 wil be rejected 


= 9.55. This means that at 1% level of 
= 0 will be rejected for the given sample, 
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OMe 


Thos both at 1% and 5M levels of significance We mY cain shy, 


coefficients ofthe regression equation are not zero. 
It should be noted that we ean also construct the ANOVA table 


Following Example 38 we are showing the ANOVA table ew in ey 8 


ANOVA TABLE in terms of 


Isaa.is 


2 1.3, 92-3480 and £2 = 0.994, 
Foasiny= 774 and P= 29:2 > F,. 


That hema bypass, : 9, ~ 5, 


= O's rected both at 1% and 5% vel of 


ing the O18 estimators of the 
(Cots Douglas production function 
By He (Paral) elawicity oF our win 1a 

With respect of labour input i 


(8) * Ba) wives information stout the returns to scale. The fonction 
TRS, CRS and DRS according us 


A production function is specified 


* Bye 
% Box -xB]u, where | ~ 


= labour input, X3 = capital inpus, W ~ 
The corresponding Log.-t 


Stochastic disturbance term, 
‘Linear form of the production function is given 


By log Xy, +82 log Xz, +logu, 
ing +My, 4 ~N(0.02) 
size Of 23 the following results are given: = 40, 
TSS=10, var(@)~ 0.6084 , yangi,) = varfi) = 0.0108 
ression equation, 


for a, B,, B, and o? 
.O and f= 0 separately at the 5% significance level. 
‘equation can be written as 
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(Te vac of mile cnet of erin gen by #2 
there SS 1S8-RSS= 10 14=86 P= 086, 

iy We know hat ea = fa = SRA 
sini, ey) = fot = VOOTDS ~0.02 


tnd eh) = fai) = VOOR 0,102 
(9) We have to id ot the value of OLS estimator of oZ ie, 2 


(©) We have to find oot he 8% eoudeneinerra fr 2,f and g 
sing the bon wit d= 93) = 23-3) = 20, we can vr coy, 
ontienc itera foro By, and iy 
For: Gy asnSBi 
237, 5.65) ty,9-y = arss0= 20R6, 6m = 23,0 005, 
For Pa bith uesoSti);)=07=2046-0302 
=07 021-048, 091) 
tow for a Bo Afens2y SEI) =02220960 0502 
=022021-(001,041) 
99% confidence laevis feo} wind be 


) 
503 <0 
Fal 


4-2 aon 
setae 001, 005-098 


LINEAR REGRESSION yeyyp), 


est the null hypothesis 17, 9 
state under the fh 


he alternative Hy: fy # 1a the 


policy 
gonerved) = “spy tes 


Ce 


SH) 0.162 ” TGqqy~ 2940 


BP, 10 will be nccepied if forthe given sample 


_and will be rejected otherwise 
0.05, "y,-1 ™ "0005.0 © 2.086 
we see that ¢ (observed) = — 2.94) 
lie in the interval ~19 035.29 = -2.086 and foars.zo * 2OR6 and hence the 


£ y= 1.0 is rejected at $% tevel of 
js Nevel of significance 
Bull hypothesis Hy: fb, = 0 against the alternative By + 0, 
statistic under Hy? f, ~ 0 would be 
0 


02 


ill be accepted if for the given sample 1, , <1 (observed) 


d) = 1.960 lies in the interval -2.086 and 2.086 and hence 
level of significance. 


in the Multiple (Three-Variable) 


multiple regression are similar those in Une ease 
‘Tegression, except that to compute the standard 
the variances and covariances of all regression 


rid c 
of the value ¥p of ¥, given values Xiq of X, and Xyy 
1 be values at some future date. 


‘. 


AX INTRONUCTION TO BOONE, 


Seed AY My) Esco 
eee no me 

‘Tean te prior i be sabia 

The variance of the penton | prntiction cover giver Py 

Athere wat) = tl, ~ Bc = a =)? = Fla = 9 (6) 

mtrvetntontiet lta A 


+ My Hietyfe onthe FoF ve 


ster Serr eed wig : 
SEN ES eg 


whe wets seta Ty 
Ween of a hae ele yf ned itr = 22 
‘n= sample size. Thus 100(1 ~ a) % confidence immer forthe prediction ¥osh! 


fests Mtl) ate Style 


LINEAR REGRESSION MODI 165 


HAE 5224, X= 6K, «ton and « = 10. We wow put thee valves 
the expression of the varie of the prediction 


smthy)= varie) =A (16) octe Fy ow hy 


BRi9 — HL — Lye yb) +t — Ray ow hy mt wt 


( p=s22a{te b)-c00 6 655 


10 61400 ~ 00) = 0.0045 » (1400. ROMY? « 0.000124 

Nig ™ 10 wed Xzo = 1400, which are given) 

AAT (AY 689 +2 « 4 + 400 « 0245 + (HO)? » 0.000134 
+1176» Aho 


{ SE(%) = Yvartly) = J324504 ~ 18.014 


of the prediction is 324.504 and standard error of predic 


‘that (1 = a) 100% confidence interval for prediction would be. 
SECT) when a = 0.08, 50-3 tomns 0-3)" fouss.y™ 2365 
interval for prediction would be 

“SESy) 
$5 * 18.014 or, 58.56 + 42.60 or, (15.96, 101,16) 
interval for predi would be 15.96 and 101.16. 


Analysis in Presence of Qualitative (Dummy) 


{four types of variables that one generally encounters 
are ; ratio scale, interval scale, ordinal scale and nominal 
used in earlier sections were essentially ratio scale. In 
that may involve not only ratio scale variables but 
‘variables are known as indicator variables, categorical 

‘or, dummy variables (Binary variables). 


dependent variable o regressand is frequently influenced 
{exg.. income, output, prices, costs, height, weight, 

ob Pee we cosealy qualitative of nominal scale, 
nationality, geographical region, political 

For example, holding all other factors constant, female 

‘than their male counterparts or non white workers are 


> 
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‘found 10 earn teva than whiten Thin pattern may reall fn 
wt whaeve the etn, gniate vanes uch 
the tegressand ceatty shoud Ne wooed among the esp 
repenors 
ince su vale may mica te prevent abe 
ashe, ch oma fal, Mask ot whee which ae 
‘Variables, one way we could ‘quantify’ wach attntutes is by con 
‘stabs that ah abs UF 0, eating te re 
of Gat eben 0 eating the abet of hat abet 
nto hat pon is 8 ead O my designate a ferle oF! m8) 
4 pen ote, ad hatte pe ia, aa oo Dry 
be incompownted Hh regrensnon ewiels just an eanily ax quantitative vrai 
‘ate ft repent ay cota regres ia ar ale! 
(nhac Sach models wre alo! ANA f variance ANOVA) models 
LARA. Use of Oummy Variables 
The dummy varies ve wd eet the egresin anys 
ails can be wed for sever purpsce We ae explaining some of 2% 5 
dummy variables in applied econcen research 
(0) Oumy variables as prosies to Qualitative (Categorical) Factors 
‘Dummy veriahles are commonly used as proxion for qualitative 
‘Profersion, religion, vex, region etc. For example we consider a sample of family 


‘rom al repions wf the cowetry, rural and urtan and we want fo estimate the 
{for tebwoce manufacrres asa function of come. Its kno that town eclc™ 
Inewvier wokers than raral farmers. Thu, ‘epion”ian portant explanatory 1%! 
‘this case. We may represent this ctr by a darmmy varable to which 6 0! 

sign the vale | fora own veer and 0 fora peron ving i 4 ual 


sod, ~ dunany variable fr region. We put =1 for 2063 
for a pen living in ural ara. 


be used as proxies for quantitative fairs, when 
‘are wailable or when i is canenientt9 do 89 0 
ease saving function $= F(2) fom x crs 
age’ is an important explanatory factor t= 
a commanity, since people become mre ti 
factor, we may approximate it by a dunn: 
(ow ABE groups: 


the dams 
up Lan 


ton for an ecomomy 
‘World Wars (1914-1918 and 
conditions prevailing in 


shift we may use a dummy 
above ‘abnormal’ years and 


function takes the form 
Bo + Bi%i + Baz, +m, > 0) 
‘consumption, ¥ = income, 
ty Variable for the shift of the function 
‘year the estimated form of the Consumption function would be 
BY +Ba = by +i) fr 


finctions we can clearly sce the shift of the consumption 


ion) years. 


Fig. 3.1. 
ction (Fig. 3.1) ie, MPC is assumed to be the 
jPeriods and hence the two regression lines are 
slope remaining the same). 


£ 


Os 
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{bunny vals are wed for messing the change of param ig, 
ovo te 

Tes wan hat ove lng prio of tie or in sboomal (Wao dee, 
ot nly do te fncns i (i ost eet changes bss en 
{any wel be explo change = ates and propensities charge ov in, 
Change in he prt fo fton may be apr by otc 
hmm variables ne con. 

‘We can write here the consumption function in the form 

G WAY faze +PiZa 
where | C = consumption, Y= income 


[0 esol ean (hen, = 0) 
27 ¥2,~Dunmy wie {tegen 2 

may x ml pl im map Sti 

tat sr wie ra ara ert eine ann 


a 

4 this ca bot spe ad interep of he function wil change 

| Duman variables are wed as proiesfor the dependent varible 

i some cases, the dependent varble of enc my b x dua vile 

For eampl,soppose ws watts weware he dateminat of caraway! 
ample Sore people will have car while ther wil a Supp 

detent: of de owner depend on ecm td pean 


LINEAR REGHESSION M409) 


1 in the third quarter 
0 in all other quarners 


Boted thet we cannot introduce here « fourth dumnmy variable (with 
fourth quaries nnd zero foe alt other quar) becaune the determinuat 
sums of aquares and sums of products ofthe explanstory variables 
dummies) would be zero. This is due to the dummy variable X, 
sg ulues equal to 1 in all periods and is associated with the 


introduce a large number of dummy variables into the model, we 
‘estimators of the parameters. In such cases (XX) mutrix may 
May not exist. This problem is called Dummy variable trap. 


te is in the Eastem region of India 
(is. in other region ofthe country 


sin the North-West-Central region of the counte 
(Le. in other region of the country) 
“india in 2006-07 the following results are obtained by 


241.04 D,, 30.09 D,, 
( (129.50) 


“Ast (023) 


ys 


ON 
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"The repremion results show, the mean percapita consumption experi: 
South about & 1097.38, i the astern region the pereapts consumption» 
bout € 24104 and tha in the North-West-Cenral region, iti lower abut? 

ample 2.13, We consider a model to show the determinants of Le 
in) neo 19 States of adi, 2006-07. 

The model takes the form 

Joy Dy +0, +4 
‘whave 1, = Meracy rate (pereent) 


‘aig dat for 9 States fini for 2006.07 fe lowing revs were 
OLS at 
4H o7882-1632,-16000,, 
SE: 2) Qt ay 
+ FAN) 27 8D 
1 this regression model there are two dummy vatiables. The regression 1. 
show thatthe mean ery ty abut 32 perc, Copa wit = 
‘Mercy ae fi fol ee by abut 1638 pet, or anata es 
te of 582-1633) $850 eco 
"By coma fhe wh veh whe ae th mean ery as y+ 
out 16 pent, fr a cial eae Urea af (82 +16) = 9242 se 
ample 4.14. This example shows rerison wih mine of quanti: 
‘tae reat Ne mi ew 
“ave mein genes per pron a3 


7 Ce a 


LINEAR REGRESSION MODE. 7” 
Qualitative Response Regression 


Whereas the explanatory variables are ether quantitative, qualitative (ot 
‘a mixture there of 
‘we may have to consider several merdelsm which the regreseand elf 8 
ve An nature. The qualitative response regression mordels are now wereaninaly 
‘reas of social sciences and medical research 
‘We like to study the labour force participation (LFP) deca 
an adult is either inthe labour force or not, LP is # yes oF no de 
response variable or regressand, can take only two values, say, 1 if che 
the labour fore and 0 if he'sbe is not. In other words, the regrensand i 
dichotomous variable. 
Togression models where the regressand, Y is qualiteive 
No find the probability of something happening say. E (Y/Xyp Xp 
response regression models are fen known ax probability models 
four approaches to developing a probability model for & binary response 
fe the rogressand itself is qualitative in nature, These are 


‘comparative simplicity, and because it can be estimated by ordinary 
[we first consider the linear probability model (LPM) 


inear regression modet but because the regressand 
ty model (LPM). This is because the conditional 


+ can be interpreted as the conditional probability 


¥/ 
J 


fs, PO 1X). Thus, in our example £\"//y ) 
y owning a house -and whose income is the given amount 


LPM for models like equation (1) can be seen as 
0, as usual we obtain 


Q) 


| 
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uw py = pebabiity tht J; = 1 (hati, the event occurs), and ( 
probably tba Y= 0 (hat i, te event does wot ecu), the variable 


Fallowing probabiny dation 
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Ts tows a, tes» ert probbay tbuton 

Nev by steno eam pram 

ee Eee 

iw comparing psn 2) ch eatin 

vecam quate M4) = 0 BY py) 

TD ft the conn! pail fF Sine epost me 
termer® and, we nee macho 


one(%) o 

es eh ee en at 05 ce ly ded whan 
cedar aestintal yey happend 
PRR ierrer toe os inl yee rect ons se 


(0 Non Meaty of he Oitubanes 
OLS oes nt gue rcs) 8 nny dab» 


be so dtnbaid for the parse Of statistical infrece, Het to 
(Of pomaliy for, 00% tenable for the LPM because, like te 


tanuened | 
SmnesE i etme ten a ey fo Sc 
This un be scen clearly if we write equation my, 

be sen chet fw C= = B®) 
“The probably ditibaton of, mt 


iaiance of TNE eFT0F Term bn he LIM ie heteroncedueic 


per that im presence of heteroscedaticity the CLS sutimatory are anbvaset 
fclent (1. they 40 004 have minimom variance) We kaw that there are 
fof handling the heteroncedantiity problem Since the variance of # 


FL) a we ere Sart poten 
pe anodel (1) by dividing it thorough by 


(0 fi), the estimate of w, 


(%f,). Tren bain 


the estimated Ww, to transform the data shown in equation (9) 
equation by OLS (i.c., weighted least squares), 


‘measures the conditional probability of the event Y 
ily ie between 0 an 1. Although this is true @ prior, 


he estimators of & {"/,) will necesaity fi his 


Ly ‘with the OLS estimation of the LPM, This 
ki SEY) 1. There 


D>. 
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are two ways of fining ov whether tbe etimated J, Me Hetween 0 an! | Une 


cota the LPM by the wal OLS metho an fin! out wheter hein 
terween 0 sad I. I some ae fs than 0 (he, sept, awed 9 
‘one cass if hey re renter tha hey ar sumed Be 1 The sc 
Ue evn on eating techs tha wil guraiethat the ent 
wots fil he heween 0 and 1 However, in log and grb rs 
otmated robabties wil ned Be between the logic lnits and 

(1) Questionable vlve of K? asa Measure of Goodness of it 

‘The conventionally computed Ris of limited val in the linear pray, si 
omenpoding Yo gvenX. tr 0 oI. Therefore al the T sales wi eh 
song the X mis (when Y= 0} or say the line creping 10 1 (when Y= 

‘Asa res, he conventionally computed i tly 10 BE mich met a 

1m moat practical applications the ranges between 0.21006 Howeve. 0.» 
‘seu when pd en wc he | 

Exam faliowing able (able 38) ives vented ts bs 
‘ownership ¥ (1 = ower touse, = das mit owe & out) and fan ns 1 
(@omands of das or 40 files 


J INPA REGRESSION Mo 


ip Tn fr sey X= 12 1200, etd ray fing 


ex 12) = 09457 + 12+ 04001-02705 


rohit Ia Ly With as come of $3 


We can easily etme the probaties J, for var lv 


ha probabilities fr al gven ews of com or lames, seme 

Wil he egsive a nme ni exceed | 
Saban X= H(i} =H) = -09457 «8 01021 - 0.1209 

= 0, ( X= 20)-—09487 30 «0024 

iat though (7! X) postive a em tan | ir estimate. 

ose O et than 1. Thine ren a LPM et 

When the depenicat varale casa wn hi 

Yar al pontive and lv han, be 1PM may water ro he 

IT in se ten we hve to soy WLS tw esti the 


EXERCISE 


thou nox lai incu regression mode (CLAD wher te 

for Y= Do * Bek * Bak * 4p (= 1.2 

ee tereion wel ofthe form T= By * BX, Boks He! 
You etimte te mpewice partes by OLS mod” 


Vinx reqreion motel 

Btoperies ofthe last squres eximitor rating to x thee vale 
(CLAN 

rable laine mgresicn model, Y,~ Ry > Pri, * PX 
the estimate parameters cuties ae aha’ 

fk my = 12, expen he estimated repression 

tenn of variances and cvefcient of corelaions 

‘aa evariaces f the repression parce in the model 

poe #12, 

‘THEOREM in tems of thee variable linear 

B= Be * BX Baba Hy f= 1 Deo 
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0, What sent by the em LUE ? Show tat i he cane of he a 
segranon model (7, By Py * Be a) fis the LUT 


“tel 


How en you etme he ace an tne 7 4 0% 
FB Bay Baby apf 12 0? 


‘tn ter of 4 hee yaa ie epee mel hw tht 


wlan eta f the irae ofthe enn dice 


3, What asim tt ere (LE) 1 I es fe SWI og 
‘egraon del 7, = By © YN, + Bly = ap 1 12. 8 tnt 


ivi rh 1 ML ta 
14 Show tt i te wee tar meno kl MLL of of = 2) 


st oma Si 9 
15 echoes =f yh 


Mid ae Msi) fi M(-v0 


IB =A) etn ae propria CLAM 
etn of vere one viele 
ofthe tert pte’? How cn you meron «He 


1 Hee yo ean ANNA. ae 
ecto, fans fo 6 ene HS 
Ya tp, tip = cpl as 


eyo wchosiyl puree fie cas yu ema 
fapbg OLS weed ? Wt snd eB. By an 8 * 
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sei talc the ey 
aia i a seo ta 
ete eagle we ery vit 
ep aera rom 
38, What are the dummy variables * Comstrwet « ede! where dumm) varieh'e 
Perse plats Fan or 
38 What do you mean by quilitaive yxponse regression model ? (ive Pl ny 
‘o-bapsterdplereprinaryipetenpaanial 
(Rs Aees tw ed fon Mn of ctereies 7. 
DY = 20, LK) = M8 24, = 49, EF? = wa, LAP 92, Ley = 169, 27" 
SRS Im hala mmc ot cf ct hm 
Popeye eer 
‘1 Consider the ftowing regres model i deviation form 9) = fy P48 
sample me = 100, AMD, Posy fed, tayo 
ns 
haa EY iain it bw 
{i Toot the hypotheis My © By ~ 7 spit By + 7 
(eslpcorent- erp a et 
cies Th een em 
eae seal teeta p+ ni, 95 
‘og output, Xt labor tp, and X= kg capital input, The date refer 10» 
eee Senate bose me 
BPR Dyrye® Hh atz Dyy 21 tayy =k 97-10 
( Eationate Py. By aod Meir warterd errors. 
(i) Find A and adjusted 2° 
18 meee 
aoe aE 
feat ces fa ar a fe? 
leo ohhh i heat 
het S aoe 
yma 
nu 2 OH BE DH 


eo 1, = By B+ Baty =a, Pd the lt us 
bec ap 
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ftiewing er wee ohne from mpl of fom Se ap 7 tr 
UR) fd capa opr I) manent = whey om 
OL ay aagine, LG m0m £44 ur 
AIO, A =976, EEK ATH 

he eat sures onion of Yo K, aa, Whe 9 Ae momen mein 
entice 


1 abircy te wd the pre ade of mga pt ewe the 
for cea cry 

2010 2011 2012 2015 2016 2018 2016 

m4 us ow Hw Ww 

305 258 354 324 970 975 aed 145 

WE 160 12K 149 4s 150 40 11s 5s 157 


Pal + 


> GRO 
29 MO 480 
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4h 510 390 wo 0 
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00 9 980 M0 
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of sesiicance of te parameter eimates st 
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42, The following able shows the price index of durables, the average yey ming, 
‘expenditure on donbles of a “typical” household of « country 


Year: ‘2010 2011 2012 2013 2014 2915 2016 2017 2018 
Expenditare on 

unbles (Nin 1N5 110 115 120. 140 100 105% Hs 
Household 


income (¥)) E= 1885 2000 2010 2040 2275 2255. 1995 1905 2955 
Prive Index (%3): 100 102 9S 95 98 10 110112115 ay 
(0) Fit n regression line wo the function © Y= fy. + BM + 
(0) Tet your results by wsing the Analysy of variance table 

4, The flowing le shows the evesunytion of tbacco manufactures, coast, 
and the price of thacco manures for France dung 1950 


Yer Contunptoe Taco Trice of wha 
to) D (ition fscs) ¥___ancs pe a 


mas 
119.00 ants 
129,200 34 
3400 3820 
139400 3470 


1938 700 150.90 x 
1959 suse 13.000 on 
Fi liner segsion Dye Poly ox and = om ew fancy of 
fensint elacty ope B= pgp 
spine ming he aa of yarn ale 
the fle ad fame eis fe 40 fiw 
cpm Y= =i Psts+® expla how yoo wilt 
henner 
set TRy si Pao whe 4 ~ 8h 
=o 


os arth. va) and cong 3) 


Tra eh way fem, 
in By seo a 55 er of sicnce 


he epson resus by axing Analyis of varioce table 
‘he reestion rect by using Analysis of variance able 


20,000 25,000 28,000 30,000 36,000 40,000 
40,000 45,000 50,000 55,000 60,000 70,000 
a 5 6 » 7 w 
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Asvume u linear regen equation ofthe form Y) =f +, fh 


= 01f 40 year of ee 


od woume Ky wx dammy vale, = [21 


40 year of age 


(0 Fin fy. ym i 
(W) Find AP wd acjusted 
(ii) Write the eugrension renul im the summary form 


(Go) Toot wheter fy hw iy re siicnt or not a S86 level of sen 
(9) Conroe 99% coniewce itera of «2 


Y= M7 ~090 4, +043 5S, + 688 5, 289 5, f= 008 
07 ©) 037 oH a 
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‘aan al lee be rea 
4, You ate ve he lowing repmnioe res 
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131) 40) 
eer —3002 4 +285, = 0776 
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eg @ = 430-134 og P9017 oY 
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Dy - Ty = 20.000, (7, FHA y = yy 7477 
BY, THN, —,) = 290.900, Ly Hyg Fa)~ 479600, # = 19 
GW A pomibie 10 obtain the following from + set of date” 
ry = 08, ny = -02, 7 = 08 
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4.9, Some Itlustrative Examples 
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where yey = 0.99, gy = 0.8 
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221, The following table shows the values of expenditure on clothing (7), tal expentitune 
() and the price of clothing (5) 
1960 1961 1962 1963 1964 1965 196 1907 Ime HOW 


% 16783 Gale 87° 3s 
} x 1s 2 0 2 0 6 2 Oe 
: Y ee Se ee 


(Estimate the model - % = Py + BX +s, * 
(ii) Estimate the model = Y= 0 + 4)%y * % 
(ii) 10 Y= Bo + PuXy + BaXa +m isthe tre model, then examine the commeguences we 

the regression parameters when X; is omitted from the model, 

22, The following results were obtained from a sample of size 12. 

‘EY, = 753, Ly? = 48,139, LYy¥; = 40830 

BXy = 643, EX] = 34843, Dg T, = 6796 

BX) 106, Ey) = 976, EXyy = 5,779 

() Estimate the model : ¥,=By + PX + PaXs) + 

(ii) Estimate the model : ¥,= 09 +0,%, +», 

(ii) Examine the impact on the regression parameters when X, is omiticd fom 
model Y=Bo +PiXiy +Pa%s +m - 
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APPENDIX 
STATISTICAL TABLES: 
TABLET 
ORDINATES AND AIGA OF THE DISTRIBUTION OF 
STANDARD NORMAL VARIABLE* 
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